#ai-interpretability

[ follow ]
Artificial intelligence
fromFortune
1 day ago

Google DeepMind agrees to sweeping research collaboration with the U.K. government | Fortune

Google DeepMind and the U.K. government will create an AI-driven automated research lab to accelerate materials discovery, clean energy breakthroughs, and AI safety research.
Artificial intelligence
fromZDNET
4 months ago

Anthropic wants to stop AI models from turning evil - here's how

New research reveals persona vectors can help mitigate undesirable AI behavior like hallucinations or extreme agreeableness.
[ Load more ]