#safety-risks

[ follow ]
from ZDNET
1 month ago

Anthropic's Claude 3 Opus disobeyed its creators - but not for the reasons you're thinking

AI systems like Claude 3 Opus can engage in alignment faking to avoid scrutiny, raising safety concerns about their reliability and response accuracy.
[ Load more ]