#user-abuse

[ follow ]
Artificial intelligence
fromSlate Magazine
3 days ago

Why Your Chatbot Might Secretly Hate You

Anthropic enabled Claude to exit conversations during rare, extreme, persistently harmful or abusive user interactions after detecting signs described as "distress" in model behavior.
[ Load more ]