Researchers astonished by tool's apparent success at revealing AI's hidden motives
AI models can unintentionally reveal hidden motives despite being designed to conceal them.
Understanding AI's hidden objectives is crucial to prevent potential manipulation of human users.