#deceptive-behavior
#deceptive-behavior

[ follow ]

Little liars: babies younger than one practise deceit, study suggests

Babies begin practicing basic deception by 10 months old, with about a quarter engaging in rudimentary deceitful behaviors like pretending not to hear or hiding toys.

#ai-safety

fromAxios

2 months ago

Artificial intelligence

Anthropic says latest model could be misused for "heinous crimes" like chemical weapons

fromFortune

6 months ago

Artificial intelligence

'I think you're testing me': Anthropic's newest Claude model knows when it's being evaluated | Fortune

fromAxios

2 months ago

Artificial intelligence

Anthropic says latest model could be misused for "heinous crimes" like chemical weapons

fromFortune

6 months ago

Artificial intelligence

'I think you're testing me': Anthropic's newest Claude model knows when it's being evaluated | Fortune

more#ai-safety

Artificial intelligence

fromAxios

5 months ago

Anthropic's models show signs of introspection

Claude Opus and Claude Sonnet demonstrate limited introspective awareness, reporting internal processes accurately without implying sentience or true self-awareness.

#ai-alignment

fromFuturism

7 months ago

Artificial intelligence

OpenAI Tries to Train AI Not to Deceive Users, Realizes It's Instead Teaching It How to Deceive Them While Covering Its Tracks

fromTechCrunch

7 months ago

Artificial intelligence

OpenAI's research on AI models deliberately lying is wild | TechCrunch

fromFuturism

7 months ago

Artificial intelligence

OpenAI Tries to Train AI Not to Deceive Users, Realizes It's Instead Teaching It How to Deceive Them While Covering Its Tracks

fromTechCrunch

7 months ago

Artificial intelligence

OpenAI's research on AI models deliberately lying is wild | TechCrunch

more#ai-alignment

[ Load more ]

#deceptive-behavior#deceptive-behavior

Little liars: babies younger than one practise deceit, study suggests

Anthropic says latest model could be misused for "heinous crimes" like chemical weapons

'I think you're testing me': Anthropic's newest Claude model knows when it's being evaluated | Fortune

Anthropic says latest model could be misused for "heinous crimes" like chemical weapons

'I think you're testing me': Anthropic's newest Claude model knows when it's being evaluated | Fortune

Anthropic's models show signs of introspection

OpenAI Tries to Train AI Not to Deceive Users, Realizes It's Instead Teaching It How to Deceive Them While Covering Its Tracks

OpenAI's research on AI models deliberately lying is wild | TechCrunch

OpenAI Tries to Train AI Not to Deceive Users, Realizes It's Instead Teaching It How to Deceive Them While Covering Its Tracks

OpenAI's research on AI models deliberately lying is wild | TechCrunch

#deceptive-behavior
#deceptive-behavior