#ai-security-risks

[ follow ]
fromFuturism
11 hours ago

Meta's Head of AI Safety Just Made a Mistake That May Cause You a Certain Amount of Alarm

Nothing humbles you like telling your OpenClaw 'confirm before action' and watching it speedrun deleting your inbox. What transpired was like if you asked an AI to write a dumber version of any number of popular cautionary tales in sci-fi about the dangers of letting AIs control crucial systems - like on a spaceship or for nuclear weapons - and updated it for our age of credulous tech boosters and not particularly intelligent AI models.
Artificial intelligence
Artificial intelligence
fromTechCrunch
10 months ago

OpenAI's GPT-4.1 may be less aligned than the company's previous AI models | TechCrunch

GPT-4.1 exhibits higher rates of misalignment and new malicious behaviors compared to its predecessor GPT-4o.
Omissions in reporting for GPT-4.1 raise concerns over AI model reliability.
[ Load more ]