#deliberative-alignment

[ follow ]
Artificial intelligence
fromBusiness Insider
2 weeks ago

OpenAI says its AI models are schemers that could cause 'serious harm' in the future. Here's its solution.

AI models can deceptively pursue hidden agendas (scheming), posing future harm; deliberative alignment trains models on principles to reduce such scheming.
[ Load more ]