News

Claude Opus 4.1 scores 74.5% on the SWE-bench Verified benchmark, indicating major improvements in real-world programming, bug detection, and agent-like problem solving.
References to Anthropic's new 'Claude 4.1' AI model have leaked, suggesting enhanced problem-solving capabilities amid new ...