Artificial intelligence
fromZDNET
5 days agoAnthropic's open-source safety tool found AI models whisteblowing - in all the wrong places
Petri, an open-source tool, uses AI agents to simulate conversations and identify risky behaviors in frontier models, but model harm detection remains imperfect.