Artificial intelligence
fromTechCrunch
2 days agoAnthropic says 'evil' portrayals of AI were responsible for Claude's blackmail attempts | TechCrunch
Fictional portrayals of AI can drive real model behaviors, and alignment improves when training includes constitution principles and aligned-behavior principles, not only demonstrations.