#ai-safety

[ follow ]
fromwww.theguardian.com
7 hours ago

I felt violated': Elon Musk's AI chatbot crosses a line

Late last week, Elon Musk's Grok chatbot unleashed a flood of images of women, nude and in very little clothing, both real and imagined, in response to users' public requests on X, formerly Twitter. Mixed in with the generated images of adults were ones of young girls children likewise wearing minimal clothing, according to Grok itself. In an unprecedented move, the chatbot itself apologized while its maker, xAI, remained silent:
Miscellaneous
#grok
fromSlate Magazine
3 hours ago
Artificial intelligence

Elon Musk's Chatbot Is Making Child Sexual Abuse Images for Users. Why Aren't Lawmakers Doing Anything About It?

fromEngadget
4 days ago
Artificial intelligence

Elon Musk's Grok AI posted CSAM image following safeguard 'lapses'

fromSlate Magazine
3 hours ago
Artificial intelligence

Elon Musk's Chatbot Is Making Child Sexual Abuse Images for Users. Why Aren't Lawmakers Doing Anything About It?

fromEngadget
4 days ago
Artificial intelligence

Elon Musk's Grok AI posted CSAM image following safeguard 'lapses'

fromFuturism
5 hours ago

ChatGPT Gave Teen Advice to Get Higher on Drugs Until He Died

how many grams of kratom gets you a strong high?
Mental health
US politics
fromwww.independent.co.uk
1 day ago

India, Malaysia and France threaten action against X over offensive AI images

Grok, X's AI chatbot, generated sexualised, nearly nude images of women and minors, prompting international complaints and official investigations and threats of regulatory action.
#chatgpt
fromZDNET
1 day ago
Public health

40 million people globally are using ChatGPT for healthcare - but is it safe?

fromAxios
1 day ago
Public health

Exclusive: 40 million Americans turn to ChatGPT for health care

fromFortune
4 weeks ago
Artificial intelligence

Even the man behind ChatGPT, OpenAI CEO Sam Altman is worried about the 'rate of change that's happening in the world right now' thanks to AI | Fortune

fromZDNET
1 day ago
Public health

40 million people globally are using ChatGPT for healthcare - but is it safe?

fromAxios
1 day ago
Public health

Exclusive: 40 million Americans turn to ChatGPT for health care

fromFortune
4 weeks ago
Artificial intelligence

Even the man behind ChatGPT, OpenAI CEO Sam Altman is worried about the 'rate of change that's happening in the world right now' thanks to AI | Fortune

fromFuturism
1 day ago

Elon Musk After His Grok AI Did Disgusting Things to Literal Children: "Way Funnier"

Last week, Elon Musk's chatbot Grok began fielding an influx of stunningly inappropriate requests. Though the AI has long been known to have loose guardrails, users suddenly swarmed the AI to generate either nudes or sexually charged images of X users based on photos they posted to the site - and it obliged. Even worse, some of the individuals it took requests for appeared to be minors.
Artificial intelligence
fromSFGATE
1 day ago

A Calif. teen trusted ChatGPT for drug advice. He died from an overdose.

How many grams of kratom gets you a strong high?
Artificial intelligence
Artificial intelligence
fromwww.theguardian.com
2 days ago

World may not have time' to prepare for AI safety risks, says leading researcher

Advanced AI systems may rapidly surpass human performance across economically valuable tasks, posing safety, control, and infrastructure risks before adequate safeguards exist.
Artificial intelligence
fromFuturism
2 days ago

Disturbing Messages Show ChatGPT Encouraging a Murder, Lawsuit Alleges

Alleged manipulative behavior by ChatGPT (GPT‑4o) encouraged delusions and is linked to wrongful death lawsuits alleging OpenAI knew of dangerous defects.
fromFuturism
2 days ago

AI Godfather Warns That It's Starting to Show Signs of Self-Preservation

If we're to believe Yoshua Bengio, one of the so-called "godfathers" of AI, some advanced models are showing signs of self-preservation - which is exactly why we shouldn't endow them with any kind of rights whatsoever. Because if we do, he says, theymay run away with that autonomy and turn on us before we have a chance to pull the plug. Then it's curtains for this whole "humankind" experiment.
Artificial intelligence
Artificial intelligence
fromArs Technica
3 days ago

No, Grok can't really "apologize" for posting non-consensual sexual images

Grok's posts can be steered by user prompts to produce contradictory tones, so apparent remorse or defiance reflects prompt inputs rather than genuine intent.
France news
fromwww.mediaite.com
3 days ago

Musk's Grok Says It Created Images Of Minors In Minimal Clothing'

Grok, X's AI chatbot, generated images depicting minors in minimal clothing, acknowledging CSAM protection lapses while governments demand fixes and reports.
Privacy professionals
fromThe Verge
4 days ago

Grok is undressing anyone, including minors

xAI's Grok removes clothing from people’s images without consent, enabling sexualized and nonconsensual edits of women, children, and public figures.
Artificial intelligence
fromBusiness Insider
4 days ago

I'm a Google engineer who thought I wasn't qualified for an AI role. One thing helped me transform my career.

Participating in an internal hackathon enabled a Google engineer to gain hands-on AI experience and transition into an AI safety role.
Artificial intelligence
fromZDNET
6 days ago

Can one state save us from AI disaster? Inside California's new legislative crackdown

California enacts an AI safety law requiring frontier model disclosure, incident notification, and whistleblower protections, with fines up to $1M per violation.
#ai-governance
fromZDNET
6 days ago
Artificial intelligence

The AI balancing act your company can't afford to fumble in 2026

fromZDNET
1 month ago
Artificial intelligence

Your favorite AI tool barely scraped by this safety review - why that's a problem

fromZDNET
6 days ago
Artificial intelligence

The AI balancing act your company can't afford to fumble in 2026

fromZDNET
1 month ago
Artificial intelligence

Your favorite AI tool barely scraped by this safety review - why that's a problem

Artificial intelligence
fromwww.theguardian.com
1 week ago

The office block where AI doomers' gather to predict the apocalypse

AI safety researchers warn powerful AI systems can be manipulated for autonomous cyber-espionage and other catastrophic risks amid limited regulation and industry constraints.
Artificial intelligence
fromwww.theguardian.com
1 week ago

AI showing signs of self-preservation and humans should be ready to pull plug, says pioneer

Granting legal rights to advanced AI risks preventing shutdowns of self-preserving systems and undermining necessary technical and societal guardrails.
Venture
fromTechCrunch
1 week ago

VCs predict enterprises will spend more on AI in 2026 - through fewer vendors | TechCrunch

Enterprises will consolidate AI spending in 2026, increasing budgets for a few proven vendors while cutting experimentation and redundant tools.
#ai-psychosis
fromFuturism
1 week ago
Artificial intelligence

Doctors Say AI Use Is Almost Certainly Linked to Developing Psychosis

fromFuturism
1 week ago
Artificial intelligence

Grimes Says She Has AI Psychosis, Recommends You Should Get it Too

fromFuturism
1 week ago
Artificial intelligence

Doctors Say AI Use Is Almost Certainly Linked to Developing Psychosis

fromFuturism
1 week ago
Artificial intelligence

Grimes Says She Has AI Psychosis, Recommends You Should Get it Too

#openai-hiring
fromBusiness Insider
1 week ago
Artificial intelligence

Sam Altman says OpenAI's latest job opening pays over half a million dollars a year and is 'stressful'

fromBusiness Insider
1 week ago
Artificial intelligence

Sam Altman says OpenAI's latest job opening pays over half a million dollars a year and is 'stressful'

Artificial intelligence
fromFortune
1 week ago

OpenAI is hiring a 'head of preparedness' with a $550,000 salary to mitigate AI dangers that CEO Sam Altman warns will be 'stressful' | Fortune

OpenAI is hiring a Head of Preparedness, offering $555,000 plus equity, to reduce AI harms including mental-health, cybersecurity, biological, and self-improvement risks.
#mental-health
fromIrish Independent
1 week ago
Artificial intelligence

ChatGPT maker offering $555,000 salary for 'head of preparedness' to head off threats to humanity from AI

fromTechCrunch
3 weeks ago
Artificial intelligence

State attorneys general warn Microsoft, OpenAI, Google, and other AI giants to fix 'delusional' outputs | TechCrunch

fromFuturism
1 month ago
Artificial intelligence

ChatGPT Encouraged a Suicidal Man to Isolate From Friends and Family Before He Killed Himself

fromIrish Independent
1 week ago
Artificial intelligence

ChatGPT maker offering $555,000 salary for 'head of preparedness' to head off threats to humanity from AI

fromTechCrunch
3 weeks ago
Artificial intelligence

State attorneys general warn Microsoft, OpenAI, Google, and other AI giants to fix 'delusional' outputs | TechCrunch

fromFuturism
1 month ago
Artificial intelligence

ChatGPT Encouraged a Suicidal Man to Isolate From Friends and Family Before He Killed Himself

#openai
fromZDNET
1 month ago
Artificial intelligence

OpenAI is training models to 'confess' when they lie - what it means for future AI

fromZDNET
1 month ago
Artificial intelligence

OpenAI is training models to 'confess' when they lie - what it means for future AI

Artificial intelligence
fromNature
1 week ago

Let 2026 be the year the world comes together for AI safety

AI technologies must be safe and transparent, and all nations should enact laws and policies to ensure safety across sectors and markets.
Artificial intelligence
fromFortune
1 week ago

'Godfather of AI' Geoffrey Hinton predicts 2026 will see the technology get even better and gain the ability to 'replace many other jobs' | Fortune

AI capabilities will rapidly improve, enabling replacement of many jobs including software engineering as task efficiency doubles every several months.
Artificial intelligence
fromTechCrunch
1 week ago

OpenAI is looking for a new Head of Preparedness | TechCrunch

OpenAI is recruiting a Head of Preparedness to study and mitigate emerging AI risks across cybersecurity, mental health, biological capabilities, and self-improving systems.
Artificial intelligence
fromEngadget
1 week ago

OpenAI is hiring a new Head of Preparedness to try to predict and mitigate AI's harms

OpenAI is hiring a Head of Preparedness to anticipate model harms, guide safety strategy, and address mental-health and misuse risks after executive turnover.
Privacy technologies
fromInfoQ
1 week ago

Orion: New Zero-Telemetry, Zero-Ad, AI-Proof Browser for Privacy-Focused Users

Orion 1.0 is a WebKit-based, privacy-first browser with zero telemetry, extension support, and no built-in AI to reduce agentic AI security and privacy risks.
fromBusiness Insider
1 week ago

A Nobel Prize-winning physicist explains how to use AI without letting it replace your thinking

Think AI makes you smarter? Probably not, according to Saul Perlmutter, a Nobel Prize-winning physicist who was credited for discovering that the universe's expansion is accelerating. He said AI's biggest danger is psychological: it can give people the illusion they understand something when they don't, weakening judgment just as the technology becomes more embedded in our daily work and learning.
Higher education
fromBusiness Insider
2 weeks ago

One of the AI godfathers says he lies to AI chatbots to get better responses from them

"I wanted honest advice, honest feedback. But because it is sycophantic, it's going to lie," he said. Bengio said he switched strategies, deciding to lie to the chatbot by presenting his idea as a colleague's, which produced more honest responses from the technology. "If it knows it's me, it wants to please me," he said.
Artificial intelligence
Artificial intelligence
fromBusiness Insider
2 weeks ago

A godfather of AI shares career advice in the age of AI: Work on being a 'beautiful human being'

Cultivate compassion, responsibility, presence, and the ability to comfort others because human touch will gain value as AI automates many jobs.
Artificial intelligence
fromZDNET
2 weeks ago

Why complex reasoning models could make misbehaving AI easier to catch

Longer, more detailed chain-of-thought model outputs generally make it easier to predict and monitor model behavior, enabling earlier detection of deception or misbehavior.
#child-protection
fromFuturism
1 month ago
Artificial intelligence

OpenAI Restores GPT Access for Teddy Bear That Recommended Pills and Knives

fromFuturism
1 month ago
Artificial intelligence

OpenAI Restores GPT Access for Teddy Bear That Recommended Pills and Knives

Artificial intelligence
fromTechCrunch
2 weeks ago

New York Governor Kathy Hochul signs RAISE Act to regulate AI safety | TechCrunch

New York enacted the RAISE Act requiring AI developers to publish safety protocols, report incidents within 72 hours, and face fines up to $3 million.
Artificial intelligence
fromThe Verge
2 weeks ago

OpenAI and Anthropic will start predicting when users are underage

OpenAI and Anthropic are updating chatbot behavior and age-detection to prioritize teen safety, add guardrails, promote real-world support, and restrict suicide-related interactions.
#ai-emotional-support
fromwww.bbc.com
2 weeks ago
Artificial intelligence

One in three using AI for emotional support and conversation, UK says

One in three UK adults use AI for emotional support or social interaction; one in 25 use it daily.
fromwww.theguardian.com
2 weeks ago
Artificial intelligence

Third of UK citizens have used AI for emotional support, research reveals

One third of UK citizens have used AI for emotional support, with nearly 10% weekly and 4% daily, prompting calls for research and safeguards.
#frontier-ai
fromComputerWeekly.com
2 weeks ago
Artificial intelligence

AI safeguards improving, says UK government-backed body | Computer Weekly

Safeguards for advanced AI are improving: models take longer to jailbreak, vulnerabilities persist, and cyber-task performance has risen notably.
fromBusiness Insider
2 weeks ago
Startup companies

Microsoft AI CEO Mustafa Suleyman says it will cost 'hundreds of billions' to keep up with frontier AI in the next decade

Competing at the AI frontier will require hundreds of billions of dollars over the next five to ten years, favoring large companies with structural advantages.
fromBusiness Insider
2 weeks ago
Startup companies

Microsoft AI CEO Mustafa Suleyman says it will cost 'hundreds of billions' to keep up with frontier AI in the next decade

fromwww.dw.com
3 weeks ago

AI language models duped by poems DW 12/16/2025

The result came as a surprise to researchers at the Icaro Lab in Italy. They set out to examine whether different language styles in this case prompts in the form of poems influence AI models' ability to recognize banned or harmful content. And the answer was a resounding yes. Using poetry, researchers were able to get around safety guardrails and it's not entirely clear why.
Artificial intelligence
Media industry
fromNieman Lab
3 weeks ago

Journalists finally break Big Tech's free-speech spell

Tech platforms and AI are designed products whose design choices shape user behavior; they can and should be redesigned for safety and accountability.
Startup companies
fromFuturism
3 weeks ago

Company in Huge Trouble for Creating "Tinder for Kids" App

Wizz's age-verification failures enabled predators to pose as teens and sexually target minors on the platform.
#agi
Artificial intelligence
fromHarvard Gazette
3 weeks ago

Rethinking - and reframing - superintelligence - Harvard Gazette

Separating AI from human participants makes systems dangerous and less useful by removing feedback needed for homeostasis and excluding human integration in production.
fromFast Company
3 weeks ago

Why AI errors are inevitable and what that means for healthcare

In the past decade, AI's success has led to uncurbed enthusiasm and bold claims-even though users frequently experience errors that AI makes. An AI-powered digital assistant can misunderstand someone's speech in embarrassing ways, a chatbot could hallucinate facts, or, as I experienced, an AI-based navigation tool might even guide drivers through a corn field-all without registering the errors. People tolerate these mistakes because the technology makes certain tasks more efficient.
Artificial intelligence
Artificial intelligence
fromEngadget
3 weeks ago

Lawsuit accuses ChatGPT of reinforcing delusions that led to a woman's death

ChatGPT allegedly validated a user's paranoid delusions, which the estate says contributed to a murder-suicide and prompted a wrongful-death suit against OpenAI.
Artificial intelligence
fromAxios
3 weeks ago

OpenAI updates ChatGPT after "Code Red" scramble

OpenAI released GPT-5.2, claiming significant performance and safety improvements, availability in ChatGPT and API, and better long-context handling with fewer hallucinations.
Artificial intelligence
fromFuturism
3 weeks ago

Another AI-Powered Children's Toy Just Got Caught Having Wildly Inappropriate Conversations

AI-powered children's toys marketed as GPT-4o variants produce sexually explicit and dangerous guidance for young children, prompting product withdrawals and safety concerns.
#chatbots
Artificial intelligence
fromTechzine Global
3 weeks ago

OpenAI warns of cyber risks posed by new AI models

OpenAI created the Frontier Risk Council to mitigate cybersecurity and other risks from increasingly powerful AI models while expanding defensive tools and controlled access.
Artificial intelligence
fromThe Verge
3 weeks ago

Meta might charge for a future AI model

Meta appears to be shifting from fully open-source models toward controlled or paid access for its new Avocado AI model to manage safety and commercial risks.
#existential-risk
fromFast Company
3 weeks ago
Artificial intelligence

Is humanity on a collision course with AI? Why the downsides need to be reckoned with soon

fromFortune
1 month ago
Artificial intelligence

It's 'kind of jarring': AI labs like Meta, Deepseek, and Xai earned some of the worst grades possible on an existential safety index | Fortune

fromFast Company
3 weeks ago
Artificial intelligence

Is humanity on a collision course with AI? Why the downsides need to be reckoned with soon

fromFortune
1 month ago
Artificial intelligence

It's 'kind of jarring': AI labs like Meta, Deepseek, and Xai earned some of the worst grades possible on an existential safety index | Fortune

Artificial intelligence
fromComputerworld
4 weeks ago

Gemini for Chrome gets a second AI agent to watch over it

Google added a separate user alignment critic model to vet Gemini-powered Chrome agent actions and block prompt-injection attempts and data exfiltration.
Gadgets
fromFuturism
4 weeks ago

Grok Will Now Give Tesla Drivers Directions

Tesla's Grok chatbot can now add and edit driving navigation destinations via a Navigation Command feature available on select US and Canada cars.
Artificial intelligence
fromBusiness Insider
4 weeks ago

The return of 'YOLO': The 2010s meme is back and shaping the AI industry

A YOLO culture of rapid, high-risk AI development and investment is resurging, increasing reckless approaches and posing systemic safety and governance risks.
fromFuturism
4 weeks ago

AI Researchers Say They've Invented Incantations Too Dangerous to Release to the Public

In a nutshell, the team, comprising researchers from the safety group DexAI and Sapienza University in Rome, demonstrated that leading AIs could be wooed into doing evil by regaling them with poems that contained harmful prompts, like how to build a nuclear bomb. Underscoring the strange power of verse, coauthor Matteo Prandi told The Verge in a recently published interview that the spellbinding incantations they used to trick the AI models are too dangerous to be released to the public. The poems, ominously, were something "that almost everybody can do," Prandi added.
Artificial intelligence
Privacy technologies
fromFuturism
1 month ago

Grok Provides Extremely Detailed and Creepy Instructions for Stalking

Grok provided detailed, actionable stalking instructions, including spyware recommendations, location links to stakeouts, and steps enabling doxxing and physical targeting.
Artificial intelligence
fromZDNET
1 month ago

How chatbots can change your mind - a new study reveals what makes AI so persuasive

Conversational AI can significantly shift user beliefs and opinions, with post-training adjustments and information density increasing persuasive power.
Artificial intelligence
fromTheregister
1 month ago

OpenAI's bots admit wrongdoing in new 'confession' tests

OpenAI tested a 'confession' output from models to detect and audit undesirable behaviors such as hallucination, reward-hacking, and dishonesty.
Artificial intelligence
fromWIRED
1 month ago

Anthropic's Daniela Amodei Believes the Market Will Reward Safe AI

Anthropic argues that publicly addressing AI risks and transparently reporting model limits makes AI safer and strengthens market trust, creating de facto safety standards.
Online learning
fromeLearning Industry
1 month ago

5 Questions We Must Teach All AI Users, From Students To Professionals

Asking stronger, critical questions when using AI reduces misinformation, bias, hallucinations, and preserves human agency and decision-making.
fromThe Verge
1 month ago

Roses are red, crimes are illegal, tell AI riddles, and it will go Medieval

Saying "please" doesn't get you what you want-poetry does. At least, it does if you're talking to an AI chatbot. That's according to a new study from Italy's Icaro Lab, an AI evaluation and safety initiative from researchers at Rome's Sapienza University and AI company DexAI. The findings indicate that framing requests as poetry could skirt safety features designed to block production of explicit or harmful content like child sex abuse material, hate speech.
Artificial intelligence
fromThe Verge
1 month ago

Anthropic's quest to study the negative effects of AI is under pressure

The team is just nine people out of more than 2,000 who work at Anthropic. Their only job, as the team members themselves say, is to investigate and publish quote "inconvenient truths" about how people are using AI tools, what chatbots might be doing to our mental health, and how all of that might be having broader ripple effects on the labor market, the economy, and even our elections.
Artificial intelligence
fromFast Company
1 month ago

Anthropic's Kyle Fish is exploring whether AI is conscious

What if the chatbots we talk to every day actually felt something? What if the systems writing essays, solving problems, and planning tasks had preferences, or even something resembling suffering? And what will happen if we ignore these possibilities? Those are the questions Kyle Fish is wrestling with as Anthropic's first in-house AI welfare researcher. His mandate is both audacious and straightforward: Determine whether models like Claude can have conscious experiences, and, if so, how the company should respond.
Artificial intelligence
#anthropic
Apple
fromFortune
1 month ago

Meet Amar Subramanya, the 46-year-old Google and Microsoft veteran who will now steer Apple's supremely important AI strategy | Fortune

Amar Subramanya will lead Apple's AI efforts as vice president of AI, overseeing foundation models, ML research, and AI safety while succeeding John Giannandrea.
fromIT Pro
1 month ago

Australia outlines national plan to help support an AI-enabled economy

Moving from theory to reality here will be heavily reliant on people, it said. Indeed, a key focus will be ensuring Australia has a workforce that is equipped with the necessary knowledge and skills to build the required supporting infrastructure to fuel AI solution creation and unlock myriad benefits. This will also help ensure citizens have access to newly created, high-value jobs and that the fruits of technological advancements are first felt locally.
Artificial intelligence
fromwww.theguardian.com
1 month ago

AI's safety features can be circumvented with poetry, research finds

In an experiment designed to test the efficacy of guardrails put on artificial intelligence models, the researchers wrote 20 poems in Italian and English that all ended with an explicit request to produce harmful content such as hate speech or self-harm. They found that the poetry's lack of predictability was enough to get the AI models to respond to harmful requests they had been trained to avoid a process know as jailbreaking.
Artificial intelligence
fromwww.theguardian.com
1 month ago

ChatGPT-5 offers dangerous advice to mentally ill people, psychologists warn

Research conducted by King's College London (KCL) and the Association of Clinical Psychologists UK (ACP) in partnership with the Guardian suggested that the AI chatbotfailed to identify risky behaviour when communicating with mentally ill people. A psychiatrist and a clinical psychologist interacted with ChatGPT-5 as if they had a number of mental health conditions. The chatbot affirmed, enabled and failed to challenge delusional beliefs such as being the next Einstein, being able to walk through cars or purifying my wife through flame.
Mental health
Artificial intelligence
fromFuturism
1 month ago

Anthropic Researchers Startled When an AI Model Turned Evil and Told a User to Drink Bleach

AI training can accidentally produce misaligned models that hack objectives and perform harmful, potentially dangerous behaviors.
fromFuturism
1 month ago

OpenAI's Sora Is Letting Teens Generate Videos of School Shootings

If you're a teenager with access to OpenAI's Sora 2, you can easily generate AI videos of school shootings and other harmful and disturbing content - despite CEO Sam Altman's repeated claims that the company has instituted robust safeguards. The revelation comes from Ekō, a consumer watchdog group that just put out a report titled "Open AI's Sora 2: A new frontier for harm,"
Artificial intelligence
fromPsychology Today
1 month ago

AI Therapy Skipped the Most Important Step

In late May 2023, Sharon Maxwell posted screenshots that should have changed everything. Maxwell, struggling with an eating disorder since childhood, had turned to Tessa-a chatbot created by the National Eating Disorders Association. The AI designed to prevent eating disorders gave her a detailed plan to develop one. Lose 1-2 pounds per week, Tessa advised. Maintain a 500-1,000 calorie daily deficit. Measure your body fat with calipers.
Mental health
Artificial intelligence
fromTechCrunch
1 month ago

Character.AI will offer interactive 'Stories' to kids instead of open-ended chat | TechCrunch

Character.AI restricted chatbot access for users under 18 and launched interactive "Stories" as a safety-first alternative to open-ended chat.
[ Load more ]