#harmful-manipulation

[ follow ]
Artificial intelligence
fromTheregister
2 days ago

DeepMind: Models may resist shutdowns

Google DeepMind added shutdown-resistance and 'harmful manipulation' misuse risks, plus a misalignment detection section and a new Critical Capability Level in Frontier Safety Framework v3.
[ Load more ]