I loved Google's new Gemini AI-except when it gaslit me
Briefly

I loved Google's new Gemini AI-except when it gaslit me
"The product in question is Gemini 3 Pro, the latest version of Google's LLM. It's not just the foundation of Google's ChatGPT-like chatbot, also called Gemini. It underlies vast quantities of features in flagship offerings such as Google Search, Gmail, and Android. It powers Antigravity, a new Google AI coding platform that debuted on the same day. And thanks to Google Cloud, the model is also available to third-party developers as an ingredient for their apps."
"In short, Gemini 3 Pro could hardly be more essential to Google's aspiration to be AI's most important player. As Google DeepMind CEO Demis Hassabis said in the announcement, the company sees it as "a big step on the path toward AGI"-AI that's at least as capable as humans are at most cognitive tasks. Already, the announcement stated, Gemini 3 Pro "demonstrates PhD-level reasoning.""
"It's designed to be remarkably difficult (hence the name) and there has been debate over whether it's so nebulous that some of the theoretically correct answers are nuanced or wrong. According to Google's table, GPT-5.1 achieved a score of 26.5%, while Claude Sonnet 4.5 managed only 13.7%. By contrast, Gemini 3 Pro scored 37.5%, and did even better when allowed to do searches and run code, with a score of 45.8%."
Google released Gemini 3 Pro on November 18 as an advanced large language model that integrates across Google Search, Gmail, Android, and the new Antigravity AI coding platform. The model is available to third-party developers through Google Cloud. Google presents Gemini 3 Pro as a major step toward AGI and claims PhD-level reasoning. A reported benchmark table shows Gemini 3 Pro outperforming Gemini 2 Pro, OpenAI's GPT-5.1, and Anthropic's Claude Sonnet 4.5 on 20 AI benchmarks. On Humanity's Last Exam, Gemini 3 Pro scored 37.5% and 45.8% when allowed internet search and code execution, exceeding competitors' results.
Read at Fast Company
Unable to calculate read time
[
|
]