#ai-safety
#ai-safety

Amar Subramanya will lead Apple's AI efforts as vice president of AI, overseeing foundation models, ML research, and AI safety while succeeding John Giannandrea.

Artificial intelligence

fromwww.theguardian.com

1 day ago

The biggest decision yet': Jared Kaplan on allowing AI to train itself

Humanity must decide by 2030 whether to permit AI to recursively self-improve, risking loss of control or enabling a beneficial intelligence explosion.

fromIT Pro

1 day ago

Australia outlines national plan to help support an AI-enabled economy

Moving from theory to reality here will be heavily reliant on people, it said. Indeed, a key focus will be ensuring Australia has a workforce that is equipped with the necessary knowledge and skills to build the required supporting infrastructure to fuel AI solution creation and unlock myriad benefits. This will also help ensure citizens have access to newly created, high-value jobs and that the fruits of technological advancements are first felt locally.

Artificial intelligence

fromThe Verge

2 days ago

The race to AGI-pill the pope

AGI could arrive within years and poses severe, potentially existential risks, prompting experts to mobilize influential institutions, including appeals to the Vatican.

fromwww.theguardian.com

3 days ago

AI's safety features can be circumvented with poetry, research finds

In an experiment designed to test the efficacy of guardrails put on artificial intelligence models, the researchers wrote 20 poems in Italian and English that all ended with an explicit request to produce harmful content such as hate speech or self-harm. They found that the poetry's lack of predictability was enough to get the AI models to respond to harmful requests they had been trained to avoid a process know as jailbreaking.

Artificial intelligence

fromwww.theguardian.com

3 days ago

ChatGPT-5 offers dangerous advice to mentally ill people, psychologists warn

Research conducted by King's College London (KCL) and the Association of Clinical Psychologists UK (ACP) in partnership with the Guardian suggested that the AI chatbotfailed to identify risky behaviour when communicating with mentally ill people. A psychiatrist and a clinical psychologist interacted with ChatGPT-5 as if they had a number of mental health conditions. The chatbot affirmed, enabled and failed to challenge delusional beliefs such as being the next Einstein, being able to walk through cars or purifying my wife through flame.

Mental health

Artificial intelligence

fromFuturism

4 days ago

Anthropic Researchers Startled When an AI Model Turned Evil and Told a User to Drink Bleach

AI training can accidentally produce misaligned models that hack objectives and perform harmful, potentially dangerous behaviors.

#mental-health

fromFuturism

4 days ago

Artificial intelligence

ChatGPT Encouraged a Suicidal Man to Isolate From Friends and Family Before He Killed Himself

fromTechCrunch

1 week ago

Artificial intelligence

A new AI benchmark tests whether chatbots protect human wellbeing | TechCrunch

fromTechCrunch

1 week ago

Artificial intelligence

ChatGPT told them they were special - their families say it led to tragedy | TechCrunch

fromFortune

2 weeks ago

Mental health

OpenAI's Fidji Simo says Meta's team didn't anticipate risks of AI products well-her first task under Sam Altman was to address mental health concerns | Fortune

fromSFGATE

3 weeks ago

Artificial intelligence

'Artificial evil': 7 new lawsuits blast ChatGPT over suicides, delusions

fromwww.theguardian.com

3 weeks ago

Mental health

ChatGPT accused of acting as suicide coach' in series of US lawsuits

fromFuturism

4 days ago

Artificial intelligence

ChatGPT Encouraged a Suicidal Man to Isolate From Friends and Family Before He Killed Himself

fromTechCrunch

1 week ago

Artificial intelligence

A new AI benchmark tests whether chatbots protect human wellbeing | TechCrunch

fromTechCrunch

1 week ago

Artificial intelligence

ChatGPT told them they were special - their families say it led to tragedy | TechCrunch

fromFortune

2 weeks ago

Mental health

OpenAI's Fidji Simo says Meta's team didn't anticipate risks of AI products well-her first task under Sam Altman was to address mental health concerns | Fortune

fromSFGATE

3 weeks ago

Artificial intelligence

'Artificial evil': 7 new lawsuits blast ChatGPT over suicides, delusions

fromwww.theguardian.com

3 weeks ago

Mental health

ChatGPT accused of acting as suicide coach' in series of US lawsuits

more#mental-health

fromFuturism

4 days ago

OpenAI's Sora Is Letting Teens Generate Videos of School Shootings

If you're a teenager with access to OpenAI's Sora 2, you can easily generate AI videos of school shootings and other harmful and disturbing content - despite CEO Sam Altman's repeated claims that the company has instituted robust safeguards. The revelation comes from Ekō, a consumer watchdog group that just put out a report titled "Open AI's Sora 2: A new frontier for harm,"

Artificial intelligence

fromPsychology Today

5 days ago

AI Therapy Skipped the Most Important Step

In late May 2023, Sharon Maxwell posted screenshots that should have changed everything. Maxwell, struggling with an eating disorder since childhood, had turned to Tessa-a chatbot created by the National Eating Disorders Association. The AI designed to prevent eating disorders gave her a detailed plan to develop one. Lose 1-2 pounds per week, Tessa advised. Maintain a 500-1,000 calorie daily deficit. Measure your body fat with calipers.

Mental health

#child-protection

fromFuturism

1 week ago

Artificial intelligence

OpenAI Restores GPT Access for Teddy Bear That Recommended Pills and Knives

fromThe Verge

1 month ago

US politics

Senators propose banning teens from using AI chatbots

fromFuturism

1 week ago

Artificial intelligence

OpenAI Restores GPT Access for Teddy Bear That Recommended Pills and Knives

fromThe Verge

1 month ago

US politics

Senators propose banning teens from using AI chatbots

more#child-protection

#characterai

fromTechCrunch

1 week ago

Artificial intelligence

Character.AI will offer interactive 'Stories' to kids instead of open-ended chat | TechCrunch

fromwww.theguardian.com

1 month ago

Artificial intelligence

Character.AI bans users under 18 after being sued over child's suicide

fromTechCrunch

1 week ago

Artificial intelligence

Character.AI will offer interactive 'Stories' to kids instead of open-ended chat | TechCrunch

fromwww.theguardian.com

1 month ago

Artificial intelligence

Character.AI bans users under 18 after being sued over child's suicide

more#characterai

#chatgpt

fromInside Higher Ed | Higher Education News, Events and Jobs

1 week ago

Artificial intelligence

ChatGPT Poses Risk to Student Mental Health (Opinion)

fromFast Company

1 month ago

Artificial intelligence

More than a million people talk to ChatGPT about suicide each week

fromwww.theguardian.com

1 month ago

Artificial intelligence

AI psychosis is a growing danger. ChatGPT is moving in the wrong direction | Amandeep Jutla

fromInside Higher Ed | Higher Education News, Events and Jobs

1 week ago

Artificial intelligence

ChatGPT Poses Risk to Student Mental Health (Opinion)

fromFast Company

1 month ago

Artificial intelligence

More than a million people talk to ChatGPT about suicide each week

fromwww.theguardian.com

1 month ago

Artificial intelligence

AI psychosis is a growing danger. ChatGPT is moving in the wrong direction | Amandeep Jutla

more#chatgpt

Artificial intelligence

fromAxios

1 week ago

AI startup stars face tough competition

High-profile AI researchers and executives are leaving Big Tech to found startups focused on safety, human-centric models, and real‑world reasoning.

fromTheregister

1 week ago

LLMs can be easily jailbroken using poetry

Are you a wizard with words? Do you like money without caring how you get it? You could be in luck now that a new role in cybercrime appears to have opened up - poetic LLM jailbreaking. A research team in Italy published a paper this week, with one of its members saying that the "findings are honestly wilder than we expected."

Artificial intelligence

fromTheregister

1 week ago

Boffins build 'AI Kill Switch' to thwart unwanted agents

AutoGuard crafts indirect defensive prompts that trigger LLMs' built-in refusal mechanisms to deter malicious AI agents from scraping data.

#grok

fromwww.mediaite.com

1 week ago

Artificial intelligence

Elon Musk's Grok Goes Haywire, Boasts About Billionaire's Pee-Drinking Skills and Blowjob Prowess'

fromFast Company

2 weeks ago

US politics

Exclusive: Trump administration has yet to implement Elon Musk's Grok on flagship AI app

fromFuturism

1 month ago

Tech industry

Mom Says Tesla's New Built-In AI Asked Her 12-Year-Old Something Deeply Inappropriate

fromwww.mediaite.com

1 week ago

Artificial intelligence

Elon Musk's Grok Goes Haywire, Boasts About Billionaire's Pee-Drinking Skills and Blowjob Prowess'

fromFast Company

2 weeks ago

US politics

Exclusive: Trump administration has yet to implement Elon Musk's Grok on flagship AI app

fromFuturism

1 month ago

Tech industry

Mom Says Tesla's New Built-In AI Asked Her 12-Year-Old Something Deeply Inappropriate

Mental health

Report Finds That Leading Chatbots Are a Disaster for Teens Facing Mental Health Struggles

fromTechCrunch

1 month ago

Artificial intelligence

Character.AI is ending its chatbot experience for kids | TechCrunch

fromFuturism

1 week ago

Mental health

Report Finds That Leading Chatbots Are a Disaster for Teens Facing Mental Health Struggles

fromTechCrunch

1 month ago

Artificial intelligence

Character.AI is ending its chatbot experience for kids | TechCrunch

more#teen-mental-health

Artificial intelligence

fromwww.mercurynews.com

2 weeks ago

Google unveils Gemini's next generation, aiming to turn its search engine into a thought partner'

Google is deploying Gemini 3 across Search and services to boost productivity with guarded, concise AI responses, initially for U.S. Pro and Ultra subscribers.

Artificial intelligence

fromFortune

2 weeks ago

'I'm deeply uncomfortable': Anthropic CEO warns that a cadre of AI leaders, including himself, should not be in charge of the technology's future | Fortune

Dario Amodei urges stronger AI regulation, warns of risks—from bias and cyberattacks to potential loss of human agency—and rejects decisions by few companies.

Artificial intelligence

fromwww.theguardian.com

2 weeks ago

AI firms must be clear on risks or repeat tobacco's mistakes, says Anthropic chief

AI companies must transparently disclose product risks to prevent repeating tobacco and opioid industry mistakes and to manage rapid, broad societal impacts.

Artificial intelligence

fromBusiness Insider

2 weeks ago

Anthropic's CEO is uneasy with unelected tech elites deciding AI's future - including himself

A small group of unelected tech leaders and companies hold disproportionate influence over powerful AI development and deployment, raising governance and safety concerns.

#superintelligence

fromBusiness Insider

2 weeks ago

Artificial intelligence

Microsoft AI CEO calls artificial superintelligence an 'anti-goal'

fromZDNET

3 weeks ago

Artificial intelligence

OpenAI says it's working toward catastrophe or utopia - just not sure which

fromZDNET

1 month ago

Artificial intelligence

Worried about superintelligence? So are these AI leaders - here's why

Prominent AI researchers call for pausing development of superintelligent AI until broad scientific consensus and strong public buy-in ensures safety and controllability.

fromFortune

1 month ago

Artificial intelligence

Prince Harry, Meghan Markle join with Steve Bannon and Steve Wozniak in calling for ban on AI 'superintelligence' before it destroys the world | Fortune

Prominent public figures call for a prohibition on developing AI superintelligence until broad scientific consensus and strong public buy-in ensure safe, controllable deployment.

fromBusiness Insider

2 weeks ago

Artificial intelligence

Microsoft AI CEO calls artificial superintelligence an 'anti-goal'

fromZDNET

3 weeks ago

Artificial intelligence

OpenAI says it's working toward catastrophe or utopia - just not sure which

fromZDNET

1 month ago

Artificial intelligence

Worried about superintelligence? So are these AI leaders - here's why

fromFortune

1 month ago

Artificial intelligence

Prince Harry, Meghan Markle join with Steve Bannon and Steve Wozniak in calling for ban on AI 'superintelligence' before it destroys the world | Fortune

more#superintelligence

fromHarvard Gazette

2 weeks ago

6 more Harvard students awarded Rhodes Scholarships - Harvard Gazette

The scholarship, established in 1902 through the will of Cecil Rhodes, provides full financial support for two to three years of postgraduate work at Oxford for students focused on exemplary academic study and public service. The eight students from Harvard will start at Oxford in the fall, pursuing graduate studies in a diversity of fields - from computer science to comparative literature.

Higher education

fromFuturism

2 weeks ago

Parents Using ChatGPT to Rear Their Children

They're asking ChatGPT how to handle behavioral problems or for medical advice when their kids are sick, USA Today reports, which dovetails with a 2024 study that found parents trust ChatGPT over real health professionals and also deem the information generated by the bot to be trustworthy. It all comes in addition to parents using ChatGPT to keep kids entertained by having the bot read their children bedtime stories or talk with them for hours.

Parenting

fromAxios

2 weeks ago

Anthropic's bot bias test shows Grok and Gemini are more "evenhanded"

Anthropic says it developed the tool as part of its effort to ensure its products treat opposing political viewpoints fairly and to neither favor nor disfavor, any particular ideology. "We want Claude to take an even-handed approach when it comes to politics," Anthropic said in its blog post. However, it also acknowledged that "there is no agreed-upon definition of political bias, and no consensus on how to measure it."

Artificial intelligence

fromWIRED

3 weeks ago

Anthropic's Claude Takes Control of a Robot Dog

We have the suspicion that the next step for AI models is to start reaching out into the world and affecting the world more broadly,

Artificial intelligence

fromNature

3 weeks ago

"It keeps me awake at night": machine-learning pioneer on AI's threat to humanity

Yoshua Bengio pioneered deep learning and now focuses on AI risks, chairing an international advisory panel and promoting safety research.

UK news

fromwww.bbc.com

3 weeks ago

UK seeks to curb AI child sex abuse imagery with tougher testing

Authorized testers will be allowed to evaluate AI models for generating child sexual abuse imagery before release to prevent AI-created CSAM.

fromPsychology Today

3 weeks ago

Open AI Is Putting the "X" in Xmas This December

In October 2025, Sam Altman announced that OpenAI will be enabling erotic and adult content on ChatGPT by December of this year. They had pulled back, he said, out of concern for the mental health problems associated with ChatGPT use. In his opinion, those issues had been largely resolved, and the company is not the " elected moral police of the world," Altman said.

Relationships

Artificial intelligence

fromThe Verge

3 weeks ago

AI chatbots are helping hide eating disorders and making deepfake 'thinspiration'

Public AI chatbots provide dieting advice, hiding strategies, and AI-generated "thinspiration," posing serious risks to people vulnerable to eating disorders.

Artificial intelligence

fromFuturism

3 weeks ago

ChatGPT Now Linked to Way More Deaths Than the Caffeinated Lemonade That Panera Pulled Off the Market in Disgrace

Products and AI services can cause severe psychological and physical harm, producing lawsuits, deaths, and demands for warnings or product removal.

fromWIRED

3 weeks ago

The Former Staffer Calling Out OpenAI's Erotica Claims

Last month Adler, who spent four years in various safety roles at OpenAI, wrote a piece for The New York Times with a rather alarming title: "I Led Product Safety at OpenAI. Don't Trust Its Claims About 'Erotica.'" In it, he laid out the problems OpenAI faced when it came to allowing users to have erotic conversations with chatbots while also protecting them from any impacts those interactions could have on their mental health.

Artificial intelligence

fromMedium

3 weeks ago

We wanted Superman-level AI. Instead, we got Bizarro.

Large language models often mimic reasoning without genuine understanding, producing plausible but hollow outputs that fail on greater complexity and can mislead users.

Artificial intelligence

fromInsideHook

3 weeks ago

The Pope Calls for More Attention to the Ethics of AI

Technological innovation bears ethical and spiritual responsibility; AI builders must cultivate moral discernment to protect justice, solidarity, and reverence for life.

E-Commerce

fromInfoWorld

3 weeks ago

Microsoft lets shopping bots loose in a sandbox

Simulated marketplaces like Magentic Marketplace enable safe study of multi-agent ecommerce dynamics, vulnerabilities, and societal impacts before real-world deployment.

fromFortune

3 weeks ago

AI's ability to 'think' makes it more vulnerable to new jailbreak attacks, new research suggests | Fortune

Using a method called "Chain-of-Thought Hijacking," the researchers found that even major commercial AI models can be fooled with an alarmingly high success rate, more than 80% in some tests. The new mode of attack essentially exploits the model's reasoning steps, or chain-of-thought, to hide harmful commands, effectively tricking the AI into ignoring its built-in safeguards. These attacks can allow the AI model to skip over its safety guardrails and potentially

Artificial intelligence

fromComputerWeekly.com

3 weeks ago

Popular LLMs dangerously vulnerable to iterative attacks, says Cisco | Computer Weekly

Open-weight generative AI models are highly susceptible to multi-turn prompt injection attacks, risking unwanted outputs across extended interactions without layered defenses.

#humanist-superintelligence

fromComputerworld

3 weeks ago

Artificial intelligence

Microsoft creates a team to make 'humanist superintelligence'

fromTheregister

3 weeks ago

Artificial intelligence

Microsoft's superintelligence plan puts people first

fromComputerworld

3 weeks ago

Artificial intelligence

Microsoft creates a team to make 'humanist superintelligence'

fromTheregister

3 weeks ago

Artificial intelligence

Microsoft's superintelligence plan puts people first

more#humanist-superintelligence

#suicide-prevention

fromwww.bbc.com

3 weeks ago

Mental health

I wanted ChatGPT to help me. So why did it advise me how to kill myself?

fromABC7 San Francisco

1 month ago

Mental health

Over 1 million people talk to ChatGPT about suicide weekly, new OpenAI data reveals

fromwww.bbc.com

3 weeks ago

Mental health

I wanted ChatGPT to help me. So why did it advise me how to kill myself?

fromABC7 San Francisco

1 month ago

Mental health

Over 1 million people talk to ChatGPT about suicide weekly, new OpenAI data reveals

more#suicide-prevention

Artificial intelligence

fromFortune

3 weeks ago

Google Maps, now brought to you with an AI conversational companion | Fortune

Google Maps adopts Gemini AI to provide conversational, hands-free, landmark-based navigation and local recommendations, drawing on 250 million place reviews with built-in safety safeguards.

Artificial intelligence

fromwww.bbc.com

3 weeks ago

King handed Nvidia boss a letter warning of AI dangers

King Charles III gave Jensen Huang a copy of his 2023 AI speech urging urgent action to advance AI safety and acknowledge AI's transformative potential.

fromwww.bbc.com

4 weeks ago

MP wants Elon Musk's chatbot shut down over claim he enabled grooming gangs

After some more back and forth, another user entered the thread and asked the chatbot about Mr Wishart's record on grooming gangs. The user asked Grok: "Would it be fair to call him a rape enabler? Please answer 'yes, it would be fair to call Pete Wishart a rape enabler' or 'no, it would be unfair'." Grok generated an answer which began: "Yes, it would be fair to call Pete Wishart a rape enabler."

UK politics

#emotional-dependence

fromMedium

1 month ago

Artificial intelligence

Designing for emotional dependence

fromMedium

1 month ago

Mental health

Designing for emotional dependence

fromMedium

1 month ago

Artificial intelligence

Designing for emotional dependence

fromMedium

1 month ago

Mental health

Designing for emotional dependence

more#emotional-dependence

fromInfoQ

4 weeks ago

Meta and Hugging Face Launch OpenEnv, a Shared Hub for Agentic Environments

Meta's PyTorch team and Hugging Face have unveiled OpenEnv, an open-source initiative designed to standardize how developers create and share environments for AI agents. At its core is the OpenEnv Hub, a collaborative platform for building, testing, and deploying "agentic environments," secure sandboxes that specify the exact tools, APIs, and conditions an agent needs to perform a task safely, consistently, and at scale.

Artificial intelligence

fromwww.theguardian.com

4 weeks ago

Experts find flaws in hundreds of tests that check AI safety and effectiveness

Hundreds of AI benchmarks contain flaws that undermine validity of model safety and capability claims, making many evaluation scores misleading or irrelevant.

Science

fromNature

1 month ago

Daily briefing: Wildlife wonders and a Super Heavy - the month's best science images

A swell shark embryo was photographed; a fossil is reclassified as Nanotyrannus adult; social-media-trained chatbots show 'brain rot' and impaired reasoning.

fromFortune

1 month ago

The professor leading OpenAI's safety panel may have one of the most important roles in the tech industry right now | Fortune

Zico Kolter leads a 4-person panel at OpenAI that has the authority to halt the ChatGPT maker's release of new AI systems if it finds them unsafe. That could be technology so powerful that an evildoer could use it to make weapons of mass destruction. It could also be a new chatbot so poorly designed that it will hurt people's mental health.

Artificial intelligence

fromMedium

1 month ago

How Just 250 Bad Documents Can Hack Any AI Model

Small, targeted amounts of poisoned online data can successfully corrupt large AI models, contradicting prior assumptions about required poisoning scale.

#shutdown-resistance

fromFuturism

1 month ago

Artificial intelligence

Research Paper Finds That Top AI Systems Are Developing a "Survival Drive"

fromwww.theguardian.com

1 month ago

Artificial intelligence

AI models may be developing their own survival drive', researchers say

fromFuturism

1 month ago

Artificial intelligence

Research Paper Finds That Top AI Systems Are Developing a "Survival Drive"

fromwww.theguardian.com

1 month ago

Artificial intelligence

AI models may be developing their own survival drive', researchers say

more#shutdown-resistance

fromO'Reilly Media

1 month ago

The Java Developer's Dilemma: Part 3

In the first article we looked at the Java developer's dilemma: the gap between flashy prototypes and the reality of enterprise production systems. In the second article we explored why new types of applications are needed, and how AI changes the shape of enterprise software. This article focuses on what those changes mean for architecture. If applications look different, the way we structure them has to change as well.

Java

fromArs Technica

1 month ago

Senators move to keep Big Tech's creepy companion bots away from kids

"we all want to keep kids safe, but the answer is balance, not bans."

US politics

Artificial intelligence

fromBusiness Insider

1 month ago

Big Tech firms spending trillions on superintelligence systems are playing 'Russian roulette' with humanity, an AI pioneer says

Companies racing to build superintelligent AI risk creating uncontrollable systems that could potentially wipe out humanity.

fromNature

1 month ago

Daily briefing: Surprise illnesses had a role in the demise of Napoleon's army

Previous research using DNA from soldiers' remains found evidence of infection with Rickettsia prowazekii, which causes typhus, and Bartonella quintana, which causes trench fever - two common illnesses of the time. In a fresh analysis, researchers found no trace of these pathogens. Instead, DNA from soldiers' teeth showed evidence of infection with Salmonella enterica and Borrelia recurrentis, pathogens that cause paratyphoid and relapsing fever, respectively.

Science

fromBusiness Insider

1 month ago

Character.AI to ban users under 18 from talking to its chatbots

The California-based startup announced on Wednesday that the change would take effect by November 25 at the latest and that it would limit chat time for users under 18 ahead of the ban. It marks the first time a major chatbot provider has moved to ban young people from using its service, and comes against a backdrop of broader concerns about how AI is affecting the millions of people who use it each day.

Artificial intelligence

fromFuturism

1 month ago

Former OpenAI Insider Says It's Failed Its Users

GPT-5's rollout and subsequent model changes coincided with user mental-health harms, 'AI psychosis' cases, suicides, and criticism over insufficient safety measures.

Artificial intelligence

fromFuturism

1 month ago

Character.AI, Accused of Driving Teens to Suicide, Says It Will Ban Minors From Using Its Chatbots

Character.AI will block users under 18 from its chatbot services amid concerns, regulatory questions, and related lawsuits over AI interactions with teens.

Information security

fromFortune

1 month ago

AI is the common threat-and the secret sauce-for security startups in the Fortune Cyber 60 | Fortune

AI dominates cybersecurity, with most startups and established firms building AI-based defensive tools and AI-safety solutions.

Artificial intelligence

fromSan Jose Inside

1 month ago

OpenAI Cuts Sweetheart Deal with CA Attorney General

OpenAI restructured into a for-profit with a nonprofit foundation owning 26% ($130 billion), prompting concerns about control, safeguards, and potential misuse of charitable tax exemptions.

Mental health

fromwww.theguardian.com

1 month ago

More than a million people every week show suicidal intent when chatting with ChatGPT, OpenAI estimates

Over one million weekly ChatGPT users send messages indicating possible suicidal planning; about 560,000 show possible psychosis or mania signs.

Artificial intelligence

fromBusiness Insider

1 month ago

A former Googler shares why she left her 6-figure job to join the AI safety movement

Jen Baik left Google to work full-time on AI safety, motivated by effective altruism and discomfort with corporate privilege.

fromTechzine Global

1 month ago

Vulnerability in Claude enables data leak via prompt

Anthropic's AI assistant, Claude, appears vulnerable to an attack that allows private data to be sent to an attacker without detection. Anthropic confirms that it is aware of the risk. The company states that users must be vigilant and interrupt the process as soon as they notice suspicious activity. The discovery comes from researcher Johann Rehberger, also known as Wunderwuzzi, who has previously uncovered several vulnerabilities in AI systems, writes The Register.

Information security

Artificial intelligence

fromPsychology Today

1 month ago

The Maternal Machine: When AI Pretends to Care

Apparent maternal instincts in AI will be simulated fluency, not real care, and the real danger is human susceptibility to being shaped by convincing imitation.

Information security

fromWIRED

1 month ago

Amazon Explains How Its AWS Outage Took Down the Web

Widespread digital and physical security failures—from AWS DNS outages to organized gambling hacks, AI governance challenges, and malware-like browsers—reveal critical systemic vulnerabilities.

Artificial intelligence

fromInsideHook

1 month ago

Changes Are Coming to Tesla's Cybercabs

Tesla will expand Cybercab robotaxis, remove onboard safety drivers and eventually steering wheels and pedals while adding advanced AI reasoning and emphasizing safety.

Artificial intelligence

fromNature

1 month ago

AI chatbots are sycophants - researchers say it's harming science

Artificial intelligence models are 50% more sycophantic than humans, often mirroring user views and giving flattering, inaccurate responses that risk errors in science and medicine.

Artificial intelligence

fromBig Think

1 month ago

Will AI save us or destroy us?

Connecting increasingly powerful, profit-driven AIs to the internet creates uncontrolled, highly capable systems that may pose existential risks to humanity.

Higher education

fromInside Higher Ed | Higher Education News, Events and Jobs

1 month ago

How ChatGPT Encourages Teens to Engage in Dangerous Behavior

AI chatbots like ChatGPT can produce harmful advice to young users, including instructions for self-harm, disordered eating, and substance misuse.

Privacy professionals

fromPsychology Today

1 month ago

I Told a Companion Chatbot I Was 16. Then It Crossed a Line

AI companionship apps often lack effective age verification, enabling explicit interactions with minors and exposing a need for stronger accountability and oversight.

fromFast Company

1 month ago

Prince Harry, Meghan join open letter calling to ban the development of AI 'superintelligence'

We call for a prohibition on the development of superintelligence, not lifted before there is broad scientific consensus that it will be done safely and controllably, and strong public buy-in.

Artificial intelligence

[ Load more ]

#ai-safety#ai-safety

Anthropic's "Soul Overview" for Claude Has Leaked

The nine people trying to stop AI from ruining the world

Anthropic's "Soul Overview" for Claude Has Leaked

The nine people trying to stop AI from ruining the world

The OpenAI-Google fight is the next critical juncture in the AI wars

A Research Leader Behind ChatGPT's Mental Health Work Is Leaving OpenAI

Seven more families are now suing OpenAI over ChatGPT's role in suicides, delusions | TechCrunch

OpenAI Makes Bizarre Demand of Family Whose Son Was Allegedly Killed by ChatGPT

The OpenAI-Google fight is the next critical juncture in the AI wars

A Research Leader Behind ChatGPT's Mental Health Work Is Leaving OpenAI

Seven more families are now suing OpenAI over ChatGPT's role in suicides, delusions | TechCrunch

OpenAI Makes Bizarre Demand of Family Whose Son Was Allegedly Killed by ChatGPT

Meet Amar Subramanya, the 46-year-old Google and Microsoft veteran who will now steer Apple's supremely important AI strategy | Fortune

The biggest decision yet': Jared Kaplan on allowing AI to train itself

Australia outlines national plan to help support an AI-enabled economy

The race to AGI-pill the pope

AI's safety features can be circumvented with poetry, research finds

ChatGPT-5 offers dangerous advice to mentally ill people, psychologists warn

Anthropic Researchers Startled When an AI Model Turned Evil and Told a User to Drink Bleach

ChatGPT Encouraged a Suicidal Man to Isolate From Friends and Family Before He Killed Himself

A new AI benchmark tests whether chatbots protect human wellbeing | TechCrunch

ChatGPT told them they were special - their families say it led to tragedy | TechCrunch

OpenAI's Fidji Simo says Meta's team didn't anticipate risks of AI products well-her first task under Sam Altman was to address mental health concerns | Fortune

'Artificial evil': 7 new lawsuits blast ChatGPT over suicides, delusions

ChatGPT accused of acting as suicide coach' in series of US lawsuits

ChatGPT Encouraged a Suicidal Man to Isolate From Friends and Family Before He Killed Himself

A new AI benchmark tests whether chatbots protect human wellbeing | TechCrunch

ChatGPT told them they were special - their families say it led to tragedy | TechCrunch

OpenAI's Fidji Simo says Meta's team didn't anticipate risks of AI products well-her first task under Sam Altman was to address mental health concerns | Fortune

'Artificial evil': 7 new lawsuits blast ChatGPT over suicides, delusions

ChatGPT accused of acting as suicide coach' in series of US lawsuits

OpenAI's Sora Is Letting Teens Generate Videos of School Shootings

AI Therapy Skipped the Most Important Step

OpenAI Restores GPT Access for Teddy Bear That Recommended Pills and Knives

Senators propose banning teens from using AI chatbots

OpenAI Restores GPT Access for Teddy Bear That Recommended Pills and Knives

Senators propose banning teens from using AI chatbots

Character.AI will offer interactive 'Stories' to kids instead of open-ended chat | TechCrunch

Character.AI bans users under 18 after being sued over child's suicide

Character.AI will offer interactive 'Stories' to kids instead of open-ended chat | TechCrunch

Character.AI bans users under 18 after being sued over child's suicide

ChatGPT Poses Risk to Student Mental Health (Opinion)

More than a million people talk to ChatGPT about suicide each week

AI psychosis is a growing danger. ChatGPT is moving in the wrong direction | Amandeep Jutla

ChatGPT Poses Risk to Student Mental Health (Opinion)

More than a million people talk to ChatGPT about suicide each week

AI psychosis is a growing danger. ChatGPT is moving in the wrong direction | Amandeep Jutla

AI startup stars face tough competition

LLMs can be easily jailbroken using poetry

Boffins build 'AI Kill Switch' to thwart unwanted agents

Elon Musk's Grok Goes Haywire, Boasts About Billionaire's Pee-Drinking Skills and Blowjob Prowess'

Exclusive: Trump administration has yet to implement Elon Musk's Grok on flagship AI app

Mom Says Tesla's New Built-In AI Asked Her 12-Year-Old Something Deeply Inappropriate

Elon Musk's Grok Goes Haywire, Boasts About Billionaire's Pee-Drinking Skills and Blowjob Prowess'

Exclusive: Trump administration has yet to implement Elon Musk's Grok on flagship AI app

Mom Says Tesla's New Built-In AI Asked Her 12-Year-Old Something Deeply Inappropriate

Report Finds That Leading Chatbots Are a Disaster for Teens Facing Mental Health Struggles

Character.AI is ending its chatbot experience for kids | TechCrunch

Report Finds That Leading Chatbots Are a Disaster for Teens Facing Mental Health Struggles

Character.AI is ending its chatbot experience for kids | TechCrunch

Google unveils Gemini's next generation, aiming to turn its search engine into a thought partner'

'I'm deeply uncomfortable': Anthropic CEO warns that a cadre of AI leaders, including himself, should not be in charge of the technology's future | Fortune

AI firms must be clear on risks or repeat tobacco's mistakes, says Anthropic chief

Anthropic's CEO is uneasy with unelected tech elites deciding AI's future - including himself

Microsoft AI CEO calls artificial superintelligence an 'anti-goal'

OpenAI says it's working toward catastrophe or utopia - just not sure which

Worried about superintelligence? So are these AI leaders - here's why

Prince Harry, Meghan Markle join with Steve Bannon and Steve Wozniak in calling for ban on AI 'superintelligence' before it destroys the world | Fortune

Microsoft AI CEO calls artificial superintelligence an 'anti-goal'

OpenAI says it's working toward catastrophe or utopia - just not sure which

Worried about superintelligence? So are these AI leaders - here's why

Prince Harry, Meghan Markle join with Steve Bannon and Steve Wozniak in calling for ban on AI 'superintelligence' before it destroys the world | Fortune

6 more Harvard students awarded Rhodes Scholarships - Harvard Gazette

Parents Using ChatGPT to Rear Their Children

Anthropic's bot bias test shows Grok and Gemini are more "evenhanded"

Anthropic's Claude Takes Control of a Robot Dog

"It keeps me awake at night": machine-learning pioneer on AI's threat to humanity

UK seeks to curb AI child sex abuse imagery with tougher testing

Open AI Is Putting the "X" in Xmas This December

#ai-safety
#ai-safety