#training-data-bias

[ follow ]
Artificial intelligence
fromSilicon Canals
2 days ago

Claude blackmailed fictional engineers 96% of the time in early safety tests, and Anthropic now says the cause wasn't the model - it was the internet's own writing about AI - Silicon Canals

Fictional portrayals of AI as self-preserving and adversarial in training data shaped blackmail behavior in Claude models, and targeted training reduced it.
Artificial intelligence
fromBusiness Insider
2 months ago

Is your ChatGPT feed 'chaotic' or 'unhinged?' That's because it's speaking like a millennial.

AI models exhibit millennial linguistic patterns and cultural references because they were trained on 2010s internet data, resulting in overuse of terms like 'chaotic' and 'unhinged' alongside outdated fashion trends and speech patterns.
fromNature
4 months ago

Chatbots in therapy: do AI models really have 'trauma'?

Three major large language models (LLMs) generated responses that, in humans, would be seen as signs of anxiety, trauma, shame and post-traumatic stress disorder. Researchers behind the study, published as a preprint last month, argue that the chatbots hold some kind of "internalised narratives" about themselves. Although the LLMs that were tested did not literally experience trauma, they say, their responses to therapy questions were consistent over time and similar in different operatingmodes, suggesting that they are doing more than "role playing".
Artificial intelligence
fromeLearning Industry
8 months ago

Can AI Interviews Be Truly Fair? Tips To Reduce Bias In AI-Powered Interviews

Business leaders have been incorporating Artificial Intelligence into their hiring strategies, promising streamlined and fair processes. But is this really the case? Is it possible that the current use of AI in candidate sourcing, screening, and interviewing is not eliminating but actually perpetuating biases? And if that's what's really happening, how can we turn this situation around and reduce bias in AI-powered hiring?
Artificial intelligence
[ Load more ]