
"On Monday, Anthropic announced Opus 4.5, the latest version of its flagship model. It's the last of Anthropic's 4.5 series of models to be released, following the launch of Sonnet 4.5 in September and Haiku 4.5 in October. As expected, the new version of Opus has state-of-the-art performance on a range of benchmarks, including coding benchmarks (SWE-Bench and Terminal-bench), tool use (tau2-bench and MCP Atlas) and general problem solving (ARC-AGI 2, GPQA Diamond)."
"Anthropic also emphasized the Opus's computer use and spreadsheet capabilities, and launched a number of parallel products to showcase how the model holds up in those settings. Together with Opus 4.5, Anthropic will make its Claude for Chrome and Claude for Excel products - previously in pilot - more broadly available. The Chrome extension will be available to all Max users, while the Excel-focused model will be available to Max, Team and Enterprise users. Opus 4.5 also comes with memory improvements for long-context operations, which required significant changes in how the model manages its memory."
""There are improvements we made on general long context quality in training with Opus 4.5, but context windows are not going to be sufficient by themselves," Dianne Na Penn, Anthropic's head of product management for research, told TechCrunch. "Knowing the right details to remember is really important in complement to just having a longer context window." Those changes also enabled a long-requested "endless chat" feature for paid Claude users, which will allow chats to proceed without interruption when the model hits its context window. Instead, the model will compress its context memory without alerting the user."
Opus 4.5 achieves state-of-the-art results across coding, tool-use, and general problem-solving benchmarks and is the first model to score over 80 percent on SWE-Bench verified. The release completes Anthropic's 4.5 series following Sonnet and Haiku. Opus 4.5 emphasizes computer use and spreadsheet capabilities and expands Claude for Chrome and Claude for Excel availability for Max, Team, and Enterprise users. The model includes significant memory-management changes to improve long-context operations and introduces an "endless chat" capability that compresses context without alerting users. Several upgrades target agentic scenarios where Opus coordinates other models.
Read at TechCrunch
Unable to calculate read time
Collection
[
|
...
]