Apple
fromInfoQ
1 week agoApple Improves Context Window Management for its Foundation Models
iOS 26.4 enhances context window management for Apple's Foundation Models, enabling developers to optimize usage within the 4096-token limit.
The tweaks to Sonnet 4.6 have taken it past the pricier Opus 4.6 in two of 13 benchmark categories: agentic financial analysis (Finance Agent v1.1, 63.3 percent vs. 60.1 percent) and office tasks (GDPVal-AA Elo, 1633 vs. 1606). Opus 4.6 wins in six of the 13 categories, in tests that show rival Gemini 3 Pro and GPT-5.2 each leading in 2 of 13 categories. But benchmark tests should not be taken too seriously.
There is a plague in software development as of today, and it has become something we are gradually beginning to accept. It goes something like this: you're stuck on some code, so you deliver a code snippet to your favorite AI tool, hoping to debug it. You might get a solution, but then your AI tool introduces a new bug, which you suddenly have to spend time debugging.
Aaron Levie told Business Insider there's a catch to AI agents: If you feed them too much information, they start to lose the plot. The CEO of the cloud-storage giant Box calls this problem "context rot." The more data you give the AI model, "it doesn't necessarily lead to a better outcome," Levie said on Tuesday. "The model will just get very confused and potentially focus on the wrong part of the information," Levie told Business Insider. As the task drags on, the model can lose track of what it's supposed to focus on, leading to worse results, he added.