
"In an effort to probe the limits of autonomous software development Anthropic researcher Nicholas Carlini used sixteen Claude Opus 4.6 AI agents to build a Rust-based C compiler from scratch. Working in parallel on a shared repository, the agents coordinated their changes and ultimately produced a compiler capable of building the Linux 6.9 kernel across x86, ARM, and RISC-V, as well as many other open-source projects. The agents ran roughly 2,000 sessions without human intervention, incurring about $20,000 in API costs."
"He paired this setup with multiple Claude instances running in parallel, each inside its own Docker container but accessing a shared Git repo. This increased efficiency, allowing Claude to tackle multiple tasks at once, and encouraging agent specialization, with some agents handling documentation, others generated code quality, and so on. To synchronize agents, Carlini relied on a simple lock-based scheme:"
"If you ask for a solution to a long and complex problem, the model may solve part of it, but eventually it will stop and wait for continued input-a question, a status update, or a request for clarification. Carlini's approach consisted in "sticking Claude in a simple loop", so that the agent keeps working on a given task until it's perfect, then it immediately moves to the next."
Sixteen Claude Opus 4.6 AI agents built a Rust-based C compiler from scratch and produced a compiler capable of building the Linux 6.9 kernel across x86, ARM, and RISC-V. Agents worked in parallel on a shared Git repository inside separate Docker containers and coordinated changes via a lock-based task file system. The team ran roughly 2,000 sessions without human intervention, costing about $20,000 in API calls. Agents specialized into roles such as documentation and code-quality generation. A looping mechanism kept agents working continuously on tasks until completion, enabling long-running autonomous progress and parallel task throughput.
Read at InfoQ
Unable to calculate read time
Collection
[
|
...
]