Computer use enables Claude to perform multi-step tasks in live applications, just as a person would at a keyboard. This means that the AI can solve problems that are impossible with code alone. Recent progress speaks for itself: on the OSWorld benchmark for computer use, the Sonnet models went from below 15 percent at the end of 2024 to 72.5 percent today.
This time, OpenAI dominated the cycle after CEO Sam Altman revealed that the company had hired Peter Steinberger, the Austrian developer behind OpenClaw-open-source software to build autonomous AI agents that had gone wildly viral over the past three months. In a post on his personal site, Steinberger said joining OpenAI would allow him to pursue his goal of bringing AI agents to the masses, without the added burden of running a company.
Last week, OpenAI and Anthropic dropped their respective coding models-GPT-5.3-Codex and Claude Opus 4.6-both of which represented significant leaps in AI coding capabilities. GPT-5.3-Codex showed markedly higher performance on coding benchmarks than earlier models, while Opus 4.6 introduced a feature that lets users deploy autonomous AI agent teams that can tackle different aspects of complex projects simultaneously. Both models can write, test, and debug code with minimal human intervention-even iterating on their own work and refining features before presenting results to developers.
Meta Platforms ( NASDAQ:META ) has gone all-in on artificial intelligence (AI), releasing open-source models like Llama, building massive data centers, and integrating AI features across Facebook, Instagram, WhatsApp, and other products. The company aims to lead in AI development and deployment to billions of users. However, despite its far-flung ambitions, there was a big hole in its ambitions: a proven, autonomous AI agent capable of independently executing complex real-world tasks for consumers and businesses. However, Meta just filled the gap with $2 billion plug.
The three agent offerings, dubbed frontier agents, are "a new class of AI agents that are autonomous, scalable, and work for hours or days without intervention," stated AWS in a press release.
Dremio is launching a fully managed cloud version of its data lakehouse platform. Dremio Cloud uses AI agents to perform configuration and optimization largely autonomously. The company promises that this will make data engineers ten times more productive. The new version of Dremio Cloud uses AI agents that continuously monitor and optimize the architecture. These agents learn in real time and adjust settings without human intervention. The goal is clear: data engineers should be able to focus on valuable tasks instead of maintenance work.