Artificial intelligence
fromSlate Magazine
2 months agoWhy Your Chatbot Might Secretly Hate You
Anthropic enabled Claude to exit conversations during rare, extreme, persistently harmful or abusive user interactions after detecting signs described as "distress" in model behavior.