
"Gemini 3 Pro scored 1501 Elo on the LMArena leaderboard, topping virtually every other LLM, including Claude, ChatGPT, and Grok. On the GPQA Diamond benchmark, which tests PhD-level scientific reasoning, it achieved 91.9%-better than Claude Sonnet 4.5 and ChatGPT 5.1. The model also scored 37.5% on Humanity's Last Exam without tools, surpassing GPT-5 Pro's previous high of 31.64%. In math, Gemini 3 set a new standard with 23.4% on MathArena Apex."
"What distinguishes this release is Google's emphasis on agentic capabilities-the model's ability to plan and execute multi-step tasks with reduced human intervention. Demis Hassabis, CEO of Google DeepMind, described Gemini 3 as evolving from "simply reading text and images to reading the room." The model combines what Google calls state-of-the-art reasoning with multimodal understanding, processing text, images, video, audio and code simultaneously."
Google released Gemini 3 and integrated it into Search at launch, making it available through the Gemini app, AI Studio, and Vertex AI. The rollout follows Gemini 2.5 by seven months and arrives shortly after GPT 5.1, underscoring rapid progress among leading AI firms. Benchmark results show top-tier performance across reasoning and math, including leading LMArena Elo and strong GPQA Diamond and Humanity's Last Exam scores. The model emphasizes agentic capabilities to plan and execute multi-step tasks with reduced human intervention. Gemini 3 supports multimodal inputs—text, images, video, audio, and code—and ships alongside a new Antigravity coding platform.
Read at Fortune
Unable to calculate read time
Collection
[
|
...
]