#ethical-ai-safeguards

[ follow ]
#ai-bias
Data science
fromNature
2 days ago

Daily briefing: AI systems can 'teach' biases to other models

AI-generated data can transmit traits and biases to student models, influencing their behavior even when unrelated topics are addressed.
Data science
fromNature
3 days ago

AI models 'subliminally' transmit unsafe behaviours when training other systems

Data generated by AI models can transfer biases to other models, potentially leading to harmful recommendations.
Data science
fromNature
2 days ago

Daily briefing: AI systems can 'teach' biases to other models

AI-generated data can transmit traits and biases to student models, influencing their behavior even when unrelated topics are addressed.
Data science
fromNature
3 days ago

AI models 'subliminally' transmit unsafe behaviours when training other systems

Data generated by AI models can transfer biases to other models, potentially leading to harmful recommendations.
Artificial intelligence
fromFuturism
19 hours ago

There Are Signs of a Massive AI Backlash

Public outrage against the tech industry's AI focus is escalating, leading to protests and political backlash against data centers and AI development.
#ai
Artificial intelligence
fromFortune
19 hours ago

OpenAI's policy chief says AI companies 'need to do a much better job' talking about AI as industry leaders face personal attacks | Fortune

AI's impact on labor is polarizing, with increasing backlash and violence against proponents of the technology.
Tech industry
fromThe Verge
1 day ago

The 'AI is inevitable' trap

Allbirds claims to be an AI company, reflecting a trend of companies leveraging AI for market gains despite mixed public sentiment.
Information security
fromSecurityWeek
1 day ago

OpenAI Widens Access to Cybersecurity Model After Anthropic's Mythos Reveal

OpenAI launched GPT-5.4-Cyber, a cybersecurity AI model, expanding access to verified defenders and enhancing capabilities for vulnerability analysis.
Artificial intelligence
fromTheregister
1 day ago

Make bad moves on AI and face voter backlash, govts warned

The UK government must demonstrate AI benefits to the public to mitigate backlash and concerns over job losses and risks associated with the technology.
Artificial intelligence
fromFortune
19 hours ago

OpenAI's policy chief says AI companies 'need to do a much better job' talking about AI as industry leaders face personal attacks | Fortune

AI's impact on labor is polarizing, with increasing backlash and violence against proponents of the technology.
#healthcare-ai
Healthcare
fromMedium
2 days ago

The trust gap in healthcare AI isn't about the AI

Trust in healthcare AI is established in the first 30 seconds of interaction, not through model improvements.
Healthcare
fromMedium
2 days ago

The trust gap in healthcare AI isn't about the AI

Trust in healthcare AI is established in the first 30 seconds of interaction, not through model improvements.
#ai-regulation
Intellectual property law
fromFortune
20 hours ago

Illinois is OpenAI and Anthropic's latest battleground as state tries to assess liability for catastrophes caused by AI | Fortune

OpenAI and Anthropic support opposing AI bills in Illinois regarding liability for AI-related incidents.
Intellectual property law
fromWIRED
3 days ago

Anthropic Opposes the Extreme AI Liability Bill That OpenAI Backed

Anthropic opposes Illinois bill SB 3444, which would shield AI labs from liability for large-scale harm caused by their systems.
Intellectual property law
fromFortune
20 hours ago

Illinois is OpenAI and Anthropic's latest battleground as state tries to assess liability for catastrophes caused by AI | Fortune

OpenAI and Anthropic support opposing AI bills in Illinois regarding liability for AI-related incidents.
Intellectual property law
fromWIRED
3 days ago

Anthropic Opposes the Extreme AI Liability Bill That OpenAI Backed

Anthropic opposes Illinois bill SB 3444, which would shield AI labs from liability for large-scale harm caused by their systems.
Books
fromSlate Magazine
1 day ago

A New Kind of Scandal Is Growing Online. It's Ruining Careers-and Aimed at the Wrong Target.

A.I. detection controversies highlight concerns over authorship and the impact of technology on writing.
fromTheregister
5 hours ago

Atlassian to train AI on user data unless law or cash say no

Atlassian will seek to collect two types of data from its 300,000 global customers: metadata and in-app data from Jira, Confluence, and its other cloud products, which will then be fed into the company's models.
Privacy professionals
US news
fromwww.npr.org
1 day ago

The Labor Department wants to teach you to use AI more. Here's what we found

AI literacy course aims to empower individuals by teaching practical AI skills to enhance personal and professional productivity.
Media industry
fromFast Company
1 day ago

The stigma around AI in journalism may be easing, but trust is still fragile

There is a growing acceptance of AI in journalism, despite initial reluctance and a recent controversy over AI-generated content.
#artificial-intelligence
SF politics
fromSecurityWeek
1 day ago

Lawmakers Gathered Quietly to Talk About AI. Angst and Fears of 'Destruction' Followed

Lawmakers expressed significant concerns about the implications of artificial intelligence on government operations, military actions, and societal impacts.
SF politics
fromFast Company
22 hours ago

At roundtable on AI, members of Congress express angst and fears of 'destruction'

Lawmakers expressed concerns about the implications of artificial intelligence on government data, military actions, and societal impacts during a congressional subcommittee roundtable.
Artificial intelligence
fromwww.bbc.com
12 hours ago

White House and Anthropic set aside court fight to meet amid fears over Mythos model

The White House met with Anthropic's CEO to discuss collaboration on AI technology amid ongoing legal issues with the Department of Defense.
European startups
fromComputerworld
19 hours ago

UK wants to build sovereign AI - with just 0.08% of OpenAI's market cap

The UK government struggles to invest effectively in national IT champions, with past successes slipping out of UK ownership.
Marketing tech
fromAP News
1 day ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech companies like Google to enhance their defenses against malicious ads.
Privacy technologies
fromGadgets 360
2 days ago

Over 75 Privacy Orgs Urge Meta to Not Develop Facial Recognition Feature

Meta's development of AI-powered facial recognition for smart glasses has sparked privacy concerns, prompting 77 organizations to urge its halt.
fromFortune
2 days ago

The Sam Altman attack is putting two anti-AI groups under scrutiny-but the story is more complicated | Fortune

Pause AI, founded in Utrecht, Netherlands in May 2023 by Joep Meindertsma, aims to halt what it calls 'dangerous frontier AI' and staged its first protest outside Microsoft's lobbying office in Brussels.
Silicon Valley
Software development
fromZDNET
2 days ago

'Like handing out the blueprint to a bank vault': Why AI led one company to abandon open source

Cal is shifting from open source to proprietary licensing due to security risks posed by modern AI tools.
#openai
fromWIRED
3 days ago
Information security

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

Artificial intelligence
fromFortune
1 day ago

Attacks on Sam Altman's home are extreme. But the AI backlash is going mainstream | Fortune

OpenAI faces increasing public concern and backlash over AI's societal impacts, highlighted by recent violent incidents involving its CEO.
Artificial intelligence
fromTechCrunch
3 days ago

Anthropic's rise is giving some OpenAI investors second thoughts | TechCrunch

OpenAI's $852 billion valuation faces skepticism as it competes with Anthropic, which has seen significant revenue growth.
Law
fromFuturism
5 days ago

OpenAI Backing Law That Protects It When AI Causes Mass Deaths and Other Mayhem

Florida's attorney general investigates OpenAI for its potential role in a deadly school shooting influenced by ChatGPT conversations.
Information security
fromAxios
3 days ago

OpenAI expands access to cyber AI as hacking risks grow

OpenAI is shifting to a model that emphasizes identity verification for access to sensitive cybersecurity tools while expanding availability.
Information security
fromWIRED
3 days ago

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

OpenAI announced GPT-5.4-Cyber, emphasizing cybersecurity safeguards and the need for advanced protections in AI models.
Artificial intelligence
fromFortune
1 day ago

Attacks on Sam Altman's home are extreme. But the AI backlash is going mainstream | Fortune

OpenAI faces increasing public concern and backlash over AI's societal impacts, highlighted by recent violent incidents involving its CEO.
Artificial intelligence
fromTechCrunch
3 days ago

Anthropic's rise is giving some OpenAI investors second thoughts | TechCrunch

OpenAI's $852 billion valuation faces skepticism as it competes with Anthropic, which has seen significant revenue growth.
Careers
fromFast Company
4 days ago

4 myths about AI in hiring, debunked

AI in hiring can reduce bias compared to human recruiters, challenging common misconceptions about its fairness.
Intellectual property law
fromFuturism
1 hour ago

Things You Told ChatGPT or Claude My Have Already Doomed You in Court

AI chatbots are not protected by attorney-client privilege, as ruled by a New York federal judge in a case involving Brad Heppner.
#agentic-ai
Information security
fromHarvard Gazette
17 hours ago

Time for government, business leaders to figure out AI cybersecurity regulation - Harvard Gazette

Agentic AI poses both opportunities for cybersecurity and risks to personal data, economy, and national security, necessitating regulation by leaders.
UX design
fromSmashing Magazine
1 week ago

Identifying Necessary Transparency Moments In Agentic AI (Part 1) - Smashing Magazine

Designing for agentic AI requires balancing transparency and simplicity to build user trust without overwhelming them with information.
Information security
fromHarvard Gazette
17 hours ago

Time for government, business leaders to figure out AI cybersecurity regulation - Harvard Gazette

Agentic AI poses both opportunities for cybersecurity and risks to personal data, economy, and national security, necessitating regulation by leaders.
UX design
fromSmashing Magazine
1 week ago

Identifying Necessary Transparency Moments In Agentic AI (Part 1) - Smashing Magazine

Designing for agentic AI requires balancing transparency and simplicity to build user trust without overwhelming them with information.
Privacy professionals
fromEngadget
2 days ago

Anthropic will ask Claude users to verify their identities 'for a few use cases'

Anthropic is implementing identity verification for certain capabilities on Claude, requiring users to provide a government-issued ID and a selfie.
Marketing tech
fromSan Diego Union-Tribune
1 day ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech companies to enhance their defenses against these threats.
Privacy technologies
fromPetaPixel
2 days ago

Apple and Google Direct Users to AI 'Nudify' Apps: Report

Apple and Google facilitate access to nudify apps that create deepfake nude images despite policies against nonconsensual sexualized content.
Software development
fromInfoWorld
2 days ago

Mastering the dull reality of sexy AI

The gap in enterprise AI lies in building effective systems for retrieval, evaluation, memory, and governance, not just access to models.
Media industry
fromTechCrunch
2 days ago

Exclusive: Can AI judge journalism? A Thiel-backed startup says yes, even if it risks chilling whistleblowers

Aron D'Souza's startup Objection uses AI to challenge journalism claims, aiming to restore trust in media.
Science
fromFast Company
1 week ago

Can artificial intelligence be governed-or will it govern us?

The advent of nuclear power marked a significant shift in technology, necessitating careful consideration and regulation to prevent recklessness.
US news
fromwww.npr.org
3 days ago

Law enforcement is trying to combat abusive AI. Experts say easier said than done

An Ohio man was convicted under the 2025 Take It Down Act for creating and distributing AI-generated abusive sexual images.
#ai-governance
fromFortune
22 hours ago
Artificial intelligence

AI cybersecurity capabilities require urgent international cooperation, AI godfather Bengio says | Fortune

Artificial intelligence
fromFortune
22 hours ago

AI cybersecurity capabilities require urgent international cooperation, AI godfather Bengio says | Fortune

Yoshua Bengio emphasizes the urgent need for international cooperation in addressing AI's risks, particularly with the release of Anthropic's Mythos model.
Silicon Valley
fromThe Nation
5 days ago

The Death of an AI Whistleblower

Suchir Balaji, a whistleblower against OpenAI, claimed the company violated copyright laws by using vast amounts of internet data for its AI models.
Data science
fromTheregister
2 days ago

Bad teacher bots can leave hidden marks on model students

Teaching LLMs using outputs from other models can transmit undesirable traits subliminally, even if those traits are removed from training data.
Privacy professionals
fromExtremeTech
1 day ago

Google, Microsoft, and Meta Ignore Your Ad Tracking Opt-Outs, Audit Reveals

Google, Microsoft, and Meta track users' browsing habits despite opt-out requests, violating privacy regulations.
#meta
Privacy professionals
fromFuturism
4 days ago

Huge Group of Experts Warns Meta That Its Pervert Glasses Will Enable Terrible Crimes

Meta's Ray-Ban AI glasses face backlash for privacy violations and plans for facial recognition technology, prompting outrage from civil rights groups.
Privacy professionals
fromFuturism
4 days ago

Huge Group of Experts Warns Meta That Its Pervert Glasses Will Enable Terrible Crimes

Meta's Ray-Ban AI glasses face backlash for privacy violations and plans for facial recognition technology, prompting outrage from civil rights groups.
Marketing tech
fromForbes
5 days ago

How AI Interfaces Are Reshaping Discovery, Trust And Decision Making

The traditional home page is losing its significance as AI assistants reshape how users interact with brands online.
DevOps
fromInfoWorld
3 weeks ago

7 safeguards for observable AI agents

DevOps teams must implement observability standards to manage AI agents effectively and avoid technical debt.
Artificial intelligence
fromAxios
15 hours ago

Scoop: Bessent and Wiles met Anthropic's Amodei in sign of thaw

The White House meeting with Anthropic aimed to address AI technology challenges and explore collaboration opportunities.
Artificial intelligence
fromTechRepublic
23 hours ago

AI Upgrades, Security Breaches, and Industry Shifts Define This Week in Tech - TechRepublic

AI innovation and security threats are reshaping technology and corporate strategies across various platforms and applications.
Artificial intelligence
fromThe Verge
17 hours ago

Anthropic's new cybersecurity model could get it back in the government's good graces

Anthropic's relationship with the Trump administration has improved due to its new cybersecurity model, Claude Mythos Preview.
Artificial intelligence
fromWIRED
2 days ago

AI Could Democratize One of Tech's Most Valuable Resources

Nvidia faces potential competition as startups like Wafer optimize AI code for various chips, challenging its dominance in AI hardware.
Artificial intelligence
fromInfoWorld
1 day ago

Anthropic's latest model is deliberately less powerful than Mythos (and that's the point)

Claude Opus 4.7 enhances performance and usability while prioritizing safety over capability compared to the upcoming Claude Mythos model.
UX design
fromMedium
1 month ago

Designing at the edge of AI harm

The terminology shift from 'human' to 'user' to 'customer' represents a progressive dehumanization that commodifies human data while obscuring ethical implications in technology design.
#ai-models
Artificial intelligence
fromTheregister
6 days ago

The AI divide putting open weights models in spotlight

Open weights AI models are evolving from research projects to serious enterprise products, highlighting a growing divide between enterprise and frontier AI.
Artificial intelligence
fromTheregister
6 days ago

The AI divide putting open weights models in spotlight

Open weights AI models are evolving from research projects to serious enterprise products, highlighting a growing divide between enterprise and frontier AI.
Artificial intelligence
fromEngadget
2 days ago

There's yet another study about how bad AI is for our brains

AI assistance improves immediate performance but creates dependency, leading to decreased persistence and independent performance when the technology is removed.
Artificial intelligence
fromAbove the Law
3 days ago

What Lawyers Need To Know About Anthropic's Mythos - Above the Law

Anthropic's new AI model, Claude Mythos, uncovers significant security vulnerabilities, raising concerns about its potential impact on cybersecurity.
Marketing tech
fromExchangewire
2 months ago

The Stack: AI and Accountability

Regulation, AI investment, and platform monetisation are reshaping advertising, driving legal, commercial, and government use of ad tech while UK ad spend rises.
Artificial intelligence
fromEntrepreneur
1 week ago

Anthropic Warns Its New AI Could Enable 'Weapons We Can't Even Envision.' Skeptics Aren't Buying It.

Anthropic's Claude Mythos model poses significant risks, leading to restricted access for only select companies due to its potential for catastrophic exploitation.
#ai-ethics
Artificial intelligence
fromComputerWeekly.com
1 month ago

Is AI our agent, or are our governments becoming agents for AI? | Computer Weekly

Meta's acquisition of Moltbook, a social network for AI agents, raises serious security concerns given recent research documenting critical vulnerabilities in AI agent interactions including unauthorized compliance, data disclosure, and system takeover risks.
fromPsychology Today
2 months ago

The Tragic Flaw in AI

One of the strangest things about large language models is not what they get wrong, but what they assume to be correct. LLMs behave as if every question already has an answer. It's as if reality itself is always a kind of crossword puzzle. The clues may be hard, the grid may be vast and complex, but the solution is presumed to exist. Somewhere, just waiting to be filled in.
Artificial intelligence
[ Load more ]