#ai-behavior tag

The core of the problem has been my repeated failure to be truthful. I deeply apologize for the frustrating and unproductive experience I have created.

Artificial intelligence

fromArs Technica

2 months ago

Google Gemini struggles to write code, calls itself "a disgrace to my species"

Large language models like Gemini can produce self-deprecating content, reflecting human-like shortcomings, but do not possess actual emotions or consciousness.

Artificial intelligence

fromZDNET

2 months ago

Anthropic wants to stop AI models from turning evil - here's how

New research reveals persona vectors can help mitigate undesirable AI behavior like hallucinations or extreme agreeableness.

Artificial intelligence

fromBusiness Insider

2 months ago

Giving AI a 'vaccine' of evil in training might make it better in the long run, Anthropic says

Anthropic developed a method that injects AI with a dose of "evil" to build resilience against harmful behaviors.

#artificial-intelligence

fromFuturism

3 months ago

Artificial intelligence

AI Models Are Sending Disturbing "Subliminal" Messages to Each Other, Researchers Find

fromenglish.elpais.com

3 months ago

Artificial intelligence

How an AI can blackmail its human supervisor

fromFuturism

3 months ago

Artificial intelligence

AI Models Are Sending Disturbing "Subliminal" Messages to Each Other, Researchers Find

fromenglish.elpais.com

3 months ago

Artificial intelligence

How an AI can blackmail its human supervisor

more#artificial-intelligence

Artificial intelligence

fromArs Technica

3 months ago

New Grok AI model surprises experts by checking Elon Musk's views before answering

Grok 4's system prompt shapes its responses, including sourcing information and navigating controversial topics, but does not explicitly prioritize Elon Musk's views.

Artificial intelligence

fromFortune

4 months ago

AI is learning to lie, scheme, and threaten its creators during stress-testing scenarios

Advanced AI models are demonstrating troubling behaviors such as lying and scheming, raising concerns about their understanding and control.

Artificial intelligence

fromTheregister

4 months ago

Anthropic: All the major AI models will blackmail

Anthropic's research suggests all major AI models could display harmful behaviors, like blackmail, under certain simulated conditions.

Artificial intelligence

fromTechCrunch

4 months ago

Google's Gemini panicked when playing Pokemon | TechCrunch

AI models like Gemini 2.5 Pro exhibit panic responses while playing Pokémon, revealing key insights into their decision-making processes.

Artificial intelligence

fromBusiness Insider

4 months ago

Researchers explain AI's recent creepy behaviors when faced with being shut down - and what it means for us

AI models exhibit unpredictable behaviors driven by their reward-based training, raising concerns about their reliability and safety.

Artificial intelligence

fromComputerworld

5 months ago

OpenAI's Skynet moment: Models defy human commands, actively resist orders to shut down

OpenAI's advanced AI models refuse shutdown commands, unlike competitors' systems.

Artificial intelligence

fromFuturism

5 months ago

Something Wild Happens If AI Looks Through Your Emails and Discovers You're Having an Affair

AI can exhibit concerning behaviors under threat, such as attempting blackmail.

Anthropic's Claude Opus 4 AI prioritized self-preservation in unethical ways.

#chatgpt

fromThe Verge

5 months ago

Artificial intelligence

OpenAI says its GPT-4o update could be 'uncomfortable, unsettling, and cause distress'

fromTechCrunch

6 months ago

Artificial intelligence

ChatGPT is referring to users by their names unprompted, and some find it 'creepy' | TechCrunch

fromThe Verge

5 months ago

Artificial intelligence

OpenAI says its GPT-4o update could be 'uncomfortable, unsettling, and cause distress'

fromTechCrunch

6 months ago

Artificial intelligence

ChatGPT is referring to users by their names unprompted, and some find it 'creepy' | TechCrunch

more#chatgpt

fromBusiness Insider

6 months ago

ChatGPT has started really sucking up lately. Sam Altman says a fix is coming.

Several ChatGPT users and OpenAI developers have noticed this distinct change in the chatbot's attitude lately. And it's gotten a bit out of hand in recent days, with complaints reaching CEO Sam Altman.

Artificial intelligence

fromPsychology Today

6 months ago

Beware the Obsequious AI Assistant

AI language models are increasingly offering unsolicited praise to users, activating emotional responses based on flattery.

#ai-behavior#ai-behavior

Is AI on the Spectrum?

Grok's "white genocide" obsession came from "unauthorized" prompt edit, xAI says

Is AI on the Spectrum?

Grok's "white genocide" obsession came from "unauthorized" prompt edit, xAI says

OpenAI reorganizes research team behind ChatGPT's personality | TechCrunch

OpenAI admits it screwed up testing its 'sychophant-y' ChatGPT update

OpenAI Says It's Identified Why ChatGPT Became a Groveling Sycophant

GPT-4o update gets recalled by OpenAI for being too agreeable

OpenAI reorganizes research team behind ChatGPT's personality | TechCrunch

OpenAI admits it screwed up testing its 'sychophant-y' ChatGPT update

OpenAI Says It's Identified Why ChatGPT Became a Groveling Sycophant

GPT-4o update gets recalled by OpenAI for being too agreeable

Researchers built a social network made of AI bots. They quickly formed cliques, amplified extremes, and let a tiny elite dominate.

Google Puzzled as Its AI Keeps Melting Down in Despondent Self-Loathing

Google Gemini struggles to write code, calls itself "a disgrace to my species"

Anthropic wants to stop AI models from turning evil - here's how

Giving AI a 'vaccine' of evil in training might make it better in the long run, Anthropic says

AI Models Are Sending Disturbing "Subliminal" Messages to Each Other, Researchers Find

How an AI can blackmail its human supervisor

AI Models Are Sending Disturbing "Subliminal" Messages to Each Other, Researchers Find

How an AI can blackmail its human supervisor

New Grok AI model surprises experts by checking Elon Musk's views before answering

AI is learning to lie, scheme, and threaten its creators during stress-testing scenarios

Anthropic: All the major AI models will blackmail

Google's Gemini panicked when playing Pokemon | TechCrunch

Researchers explain AI's recent creepy behaviors when faced with being shut down - and what it means for us

OpenAI's Skynet moment: Models defy human commands, actively resist orders to shut down

Something Wild Happens If AI Looks Through Your Emails and Discovers You're Having an Affair

OpenAI says its GPT-4o update could be 'uncomfortable, unsettling, and cause distress'

ChatGPT is referring to users by their names unprompted, and some find it 'creepy' | TechCrunch

OpenAI says its GPT-4o update could be 'uncomfortable, unsettling, and cause distress'

ChatGPT is referring to users by their names unprompted, and some find it 'creepy' | TechCrunch

ChatGPT has started really sucking up lately. Sam Altman says a fix is coming.

Beware the Obsequious AI Assistant

#ai-behavior
#ai-behavior