Claude Opus 4.7 leads on SWE-bench and agentic reasoning, beating GPT-5.4 and Gemini 3.1 Pro

"Opus 4.7 scores 64.3% on SWE-bench Pro, significantly ahead of GPT-5.4 at 57.7% and Gemini 3.1 Pro at 54.2%, showcasing its superior capabilities in software engineering."

"The model demonstrates a 14% improvement in multi-step agentic reasoning and three times higher image resolution, making it a strong choice for developers and enterprises."

"With a $30 billion annualized revenue rate and investor offers at roughly $800 billion, Anthropic is in early IPO talks, emphasizing the importance of Opus 4.7 in justifying these figures."

Claude Opus 4.7 has been released as Anthropic's most advanced model, achieving benchmark-leading scores in software engineering and multi-agent coordination. It scored 64.3% on SWE-bench Pro, surpassing GPT-5.4 and Gemini 3.1 Pro. The model also shows a 14% improvement in multi-step reasoning and three times higher image resolution. Priced at $5/$25 per million tokens, it is available through various platforms. Anthropic's commercial momentum is strong, with a $30 billion annual revenue rate and early IPO discussions.

#claude-opus-47 #software-engineering #ai-models #anthropic #benchmark-performance

Read at TNW | Anthropic

Unable to calculate read time

Collection

[

...

]

Claude Opus 4.7 leads on SWE-bench and agentic reasoning, beating GPT-5.4 and Gemini 3.1 ProClaude Opus 4.7 leads on SWE-bench and agentic reasoning, beating GPT-5.4 and Gemini 3.1 Pro Briefly

Claude Opus 4.7 leads on SWE-bench and agentic reasoning, beating GPT-5.4 and Gemini 3.1 Pro
Claude Opus 4.7 leads on SWE-bench and agentic reasoning, beating GPT-5.4 and Gemini 3.1 Pro
Briefly