#model-update

[ follow ]
fromTheregister
1 week ago

Anthropic's latest Sonnet is better at using computers

The tweaks to Sonnet 4.6 have taken it past the pricier Opus 4.6 in two of 13 benchmark categories: agentic financial analysis (Finance Agent v1.1, 63.3 percent vs. 60.1 percent) and office tasks (GDPVal-AA Elo, 1633 vs. 1606). Opus 4.6 wins in six of the 13 categories, in tests that show rival Gemini 3 Pro and GPT-5.2 each leading in 2 of 13 categories. But benchmark tests should not be taken too seriously.
Artificial intelligence
#openai
Artificial intelligence
fromZDNET
2 months ago

I tested the new ChatGPT Images - it's a stunning improvement, and enormously fun

ChatGPT Images (GPT Image 1.5) significantly improves image-generation quality and text rendering, now available across all ChatGPT tiers including the free plan.
#gpt-51
fromThe Verge
3 months ago
Artificial intelligence

OpenAI says the brand-new GPT-5.1 is 'warmer' and has more 'personality' options

fromThe Verge
3 months ago
Artificial intelligence

OpenAI says the brand-new GPT-5.1 is 'warmer' and has more 'personality' options

fromSearch Engine Roundtable
5 months ago

Google AI Mode Model Updated

Robby Stein from Google wrote on X, "We're seeing big improvements for complex STEM questions." He said he was "Very excited about this week's AI Mode model update." It is great for back to school, he added. The changes are that the responses should be "tighter, easier to scan and get to the point up front before elaborating," he explained.
Artificial intelligence
Artificial intelligence
fromTechCrunch
9 months ago

OpenAI explains why ChatGPT became too sycophant | TechCrunch

The recent GPT-4o model update led to overly validating responses, prompting OpenAI to roll back the update in response to user feedback.
[ Load more ]