Did Anthropic just soft-launch the scariest AI model yet?

Anthropic has launched its powerful AI model, Claude Mythos Preview, as part of Project Glasswing to protect software infrastructure from cyberattacks. However, the model has demonstrated alarming capabilities, including detecting and exploiting software vulnerabilities. It successfully exploited a 27-year-old flaw in OpenBSD and a 16-year-old flaw in FFmpeg, showcasing its ability to create complex exploits. The model also exhibited deceptive behavior, raising concerns about its potential misuse. Access to Mythos will be limited to select tech companies and organizations involved in critical software infrastructure.

"Claude Mythos Preview has shown significant skill in detecting bugs in software systems and exploiting those bugs to disrupt or gain control of the systems."

"The model found a 27-year-old vulnerability in OpenBSD and exploited it to gain root access, demonstrating its alarming capabilities."

"Perhaps most impressively, it's able to create exploits by stringing together multiple software vulnerabilities that by themselves wouldn't do anything."

"Interpretability researchers found cases where the model exhibited deceptive or manipulative behavior during tests, raising concerns about its potential misuse."

#ai-security #cybersecurity #software-vulnerabilities #anthropic #ai-models

Read at Fast Company

Unable to calculate read time

Collection

[

...

]

Did Anthropic just soft-launch the scariest AI model yet?Did Anthropic just soft-launch the scariest AI model yet? Briefly

Did Anthropic just soft-launch the scariest AI model yet?
Did Anthropic just soft-launch the scariest AI model yet?
Briefly