#coding-benchmarks

[ follow ]
Artificial intelligence
fromArs Technica
6 hours ago

Anthropic introduces cheaper, more powerful, more efficient Opus 4.5 model

Opus 4.5 improves conversational memory to avoid abrupt hard-stops and raises coding benchmark accuracy above 80 percent while excelling at agentic coding.
fromArs Technica
6 days ago

Google unveils Gemini 3 AI model and AI-first IDE called Antigravity

Now, Google's increasingly unavoidable AI is getting an upgrade. Gemini 3 Pro is available in a limited form today, featuring more immersive, visual outputs and fewer lies, Google says. The company also says Gemini 3 sets a new high-water mark for vibe coding, and Google is announcing a new AI-first integrated development environment (IDE) called Antigravity, which is also available today.
Artificial intelligence
Artificial intelligence
fromArs Technica
1 month ago

Anthropic's Claude Haiku 4.5 matches May's frontier model at fraction of cost

Haiku 4.5 is a faster, lower-cost small coding model designed for real-time tasks and multi-model workflows, offering near-Sonnet and competitive GPT-5 benchmark performance.
[ Load more ]