Are AI agents finally good enough?
Briefly

Are AI agents finally good enough?
"The new model, Sonnet 4.5, is being billed as a big breakthrough in autonomous, agentic AI, especially for coding purposes. These types of AI products can, in theory, be given complex tasks and then go off and complete them over the course of many hours or even multiple days. Anthropic says this particular model can run for up to 30 hours straight without any human intervention - all while working on a singular task, like building a software application from scratch."
"For the last year or so, companies like Anthropic, Microsoft, OpenAI, and more have been promising that this agentic technology would be the next phase of AI, the next big hype-filled thing that comes after general-purpose chatbots. They say it could really unlock generative AI's potential, and it's true that they've made some strides. But as we've seen so far, agents aren't quite there yet, and they have a ways to go."
An applied AI lead at Anthropic works with startups to implement Anthropic's technologies and tests new models to assess limitations. Anthropic released Claude Sonnet 4.5, presented as a major advance in autonomous, agentic AI with particular strength in coding. The model is claimed to perform complex, multi-hour tasks autonomously and to run continuously for up to 30 hours on a single task such as building a software application. Multiple companies have promoted agentic systems as the next phase beyond general-purpose chatbots, yet current agent technology remains imperfect, immature, and not yet widely adopted for unsupervised internet tasks.
Read at The Verge
Unable to calculate read time
[
|
]