Google Gemini 3 available: leaps in reasoning and development

"Gemini 3 Pro outperforms its predecessor, the 2.5 Pro, across all key AI benchmarks. With a score of 1501 Elo on the LMArena Leaderboard, the model sets a new standard. On Humanity's Last Exam, it scores 37.5 percent without using tools, demonstrating PhD-level reasoning. The GPQA Diamond score is 91.9 percent. In the field of mathematics, Gemini 3 Pro also sets a new benchmark with 23.4 percent on MathArena Apex."

"For multimodal reasoning, the model scores 81 percent on MMMU-Pro and 87.6 percent on Video-MMMU. Factual accuracy reaches 72.1 percent on SimpleQA Verified. According to Google, the model's response has become more direct and concrete. It provides insights rather than flattery. Users get what they need to hear, not just what they want to hear. The model functions as a true thinking partner, translating complex scientific concepts into code for visualizations or supporting creative brainstorming sessions."

"Google started the Gemini project almost two years ago. Its impact within the Google ecosystem is now significant. AI Overviews reaches 2 billion users monthly, while the Gemini app has over 650 million active users. More than 70 percent of Google Cloud customers now use AI functionality, and 13 million developers are building with the company's generative models. Each generation builds on the previous one."

Gemini 3 Pro combines advanced reasoning, multimodal understanding, and agentic capabilities while achieving top benchmark results such as 1501 Elo on LMArena and 37.5 percent on Humanity's Last Exam without tools. The model attains a GPQA Diamond score of 91.9 percent and leads in mathematics with 23.4 percent on MathArena Apex. Multimodal reasoning scores include 81 percent on MMMU-Pro and 87.6 percent on Video-MMMU, with factual accuracy of 72.1 percent on SimpleQA Verified. Responses are more direct and concrete, offering insights, translating scientific concepts into code, and supporting creative brainstorming, alongside broad user and developer adoption.

#multimodal-ai #benchmarking #reasoning #google-ecosystem

Read at Techzine Global

Unable to calculate read time

Collection

[

...

]

Google Gemini 3 available: leaps in reasoning and developmentGoogle Gemini 3 available: leaps in reasoning and development Briefly

Google Gemini 3 available: leaps in reasoning and development
Google Gemini 3 available: leaps in reasoning and development
Briefly