#arena-leaderboard

[ follow ]
Artificial intelligence
fromArs Technica
1 day ago

Google announces Gemini 3.1 Pro, says it's better at complex problem-solving

Google released Gemini 3.1 Pro, improving problem-solving and reasoning with higher benchmark scores, notably ARC-AGI-2 (77.1%) and Humanity's Last Exam (44.4%).
[ Load more ]