#ethical-ai-safeguards
#ethical-ai-safeguards

[ follow ]

Daily briefing: AI systems can 'teach' biases to other models

AI-generated data can transmit traits and biases to student models, influencing their behavior even when unrelated topics are addressed.

Data science

fromNature

3 days ago

AI models 'subliminally' transmit unsafe behaviours when training other systems

Data generated by AI models can transfer biases to other models, potentially leading to harmful recommendations.

Data science

fromNature

2 days ago

Daily briefing: AI systems can 'teach' biases to other models

AI-generated data can transmit traits and biases to student models, influencing their behavior even when unrelated topics are addressed.

Data science

fromNature

3 days ago

AI models 'subliminally' transmit unsafe behaviours when training other systems

Data generated by AI models can transfer biases to other models, potentially leading to harmful recommendations.

more#ai-bias

Artificial intelligence

fromFuturism

19 hours ago

There Are Signs of a Massive AI Backlash

Public outrage against the tech industry's AI focus is escalating, leading to protests and political backlash against data centers and AI development.

#ai

fromwww.bbc.com

23 hours ago

Information security

What is Claude Mythos and what risks does it pose?

fromThe Verge

1 day ago

Tech industry

The 'AI is inevitable' trap

Artificial intelligence

fromBusiness Matters

1 day ago

An SEO's guide to ethical AI use

AI in SEO offers speed and analytical power but raises significant ethical concerns that must be addressed for long-term success.

fromSecurityWeek

1 day ago

Information security

OpenAI Widens Access to Cybersecurity Model After Anthropic's Mythos Reveal

fromTheregister

1 day ago

Artificial intelligence

Make bad moves on AI and face voter backlash, govts warned

Artificial intelligence

fromFortune

19 hours ago

OpenAI's policy chief says AI companies 'need to do a much better job' talking about AI as industry leaders face personal attacks | Fortune

AI's impact on labor is polarizing, with increasing backlash and violence against proponents of the technology.

Information security

fromwww.bbc.com

23 hours ago

What is Claude Mythos and what risks does it pose?

Anthropic's Claude Mythos AI model outperforms humans in some cybersecurity tasks, raising concerns among regulators and tech companies.

Tech industry

fromThe Verge

1 day ago

The 'AI is inevitable' trap

Allbirds claims to be an AI company, reflecting a trend of companies leveraging AI for market gains despite mixed public sentiment.

Artificial intelligence

fromBusiness Matters

1 day ago

An SEO's guide to ethical AI use

AI in SEO offers speed and analytical power but raises significant ethical concerns that must be addressed for long-term success.

Information security

fromSecurityWeek

1 day ago

OpenAI Widens Access to Cybersecurity Model After Anthropic's Mythos Reveal

OpenAI launched GPT-5.4-Cyber, a cybersecurity AI model, expanding access to verified defenders and enhancing capabilities for vulnerability analysis.

Artificial intelligence

fromTheregister

1 day ago

Make bad moves on AI and face voter backlash, govts warned

The UK government must demonstrate AI benefits to the public to mitigate backlash and concerns over job losses and risks associated with the technology.

Artificial intelligence

fromFortune

19 hours ago

OpenAI's policy chief says AI companies 'need to do a much better job' talking about AI as industry leaders face personal attacks | Fortune

AI's impact on labor is polarizing, with increasing backlash and violence against proponents of the technology.

AI needs a reality check

Healthcare AI companies often make bold claims, but few have successfully developed treatments that work in humans.

Healthcare

fromMedium

2 days ago

The trust gap in healthcare AI isn't about the AI

Trust in healthcare AI is established in the first 30 seconds of interaction, not through model improvements.

Healthcare

fromFast Company

18 hours ago

AI needs a reality check

Healthcare AI companies often make bold claims, but few have successfully developed treatments that work in humans.

Healthcare

fromMedium

2 days ago

The trust gap in healthcare AI isn't about the AI

Trust in healthcare AI is established in the first 30 seconds of interaction, not through model improvements.

more#healthcare-ai

#ai-regulation

Intellectual property law

fromFortune

20 hours ago

Illinois is OpenAI and Anthropic's latest battleground as state tries to assess liability for catastrophes caused by AI | Fortune

OpenAI and Anthropic support opposing AI bills in Illinois regarding liability for AI-related incidents.

Intellectual property law

fromWIRED

3 days ago

Anthropic Opposes the Extreme AI Liability Bill That OpenAI Backed

Anthropic opposes Illinois bill SB 3444, which would shield AI labs from liability for large-scale harm caused by their systems.

Intellectual property law

fromFortune

20 hours ago

Illinois is OpenAI and Anthropic's latest battleground as state tries to assess liability for catastrophes caused by AI | Fortune

OpenAI and Anthropic support opposing AI bills in Illinois regarding liability for AI-related incidents.

Intellectual property law

fromWIRED

3 days ago

Anthropic Opposes the Extreme AI Liability Bill That OpenAI Backed

Anthropic opposes Illinois bill SB 3444, which would shield AI labs from liability for large-scale harm caused by their systems.

A New Kind of Scandal Is Growing Online. It's Ruining Careers-and Aimed at the Wrong Target.

A.I. detection controversies highlight concerns over authorship and the impact of technology on writing.

fromTheregister

5 hours ago

Atlassian to train AI on user data unless law or cash say no

Atlassian will seek to collect two types of data from its 300,000 global customers: metadata and in-app data from Jira, Confluence, and its other cloud products, which will then be fed into the company's models.

Privacy professionals

US news

fromwww.npr.org

1 day ago

The Labor Department wants to teach you to use AI more. Here's what we found

AI literacy course aims to empower individuals by teaching practical AI skills to enhance personal and professional productivity.

Philosophy

fromPsychology Today

19 hours ago

What AI Can't Calculate About a Human Life

Human life is a singular, unrepeatable event, contrasting with AI's reliance on patterns and probabilities.

Media industry

fromFast Company

1 day ago

The stigma around AI in journalism may be easing, but trust is still fragile

There is a growing acceptance of AI in journalism, despite initial reluctance and a recent controversy over AI-generated content.

#artificial-intelligence

fromSecurityWeek

1 day ago

SF politics

Lawmakers Gathered Quietly to Talk About AI. Angst and Fears of 'Destruction' Followed

fromFast Company

22 hours ago

SF politics

At roundtable on AI, members of Congress express angst and fears of 'destruction'

fromwww.bbc.com

12 hours ago

Artificial intelligence

White House and Anthropic set aside court fight to meet amid fears over Mythos model

Artificial intelligence

fromnews.bitcoin.com

23 hours ago

Anthropic Debuts Claude Opus 4.7 as Agentic Workflows Take Center Stage

Anthropic launched Claude Opus 4.7 on April 16, 2026, achieving an 87.6% score on the SWE-bench Verified test.

Artificial intelligence

fromPsychology Today

4 days ago

The ProSocial AI Index: A Better Way to Think About AI

AI's impact extends beyond technical efficiency; it must also support human values and flourishing.

Artificial intelligence

fromSecurityWeek

1 week ago

Can we Trust AI? No - But Eventually We Must

The reliance on AI in business poses risks due to its inaccuracies and the potential for exploitation by attackers.

SF politics

fromSecurityWeek

1 day ago

Lawmakers Gathered Quietly to Talk About AI. Angst and Fears of 'Destruction' Followed

Lawmakers expressed significant concerns about the implications of artificial intelligence on government operations, military actions, and societal impacts.

SF politics

fromFast Company

22 hours ago

At roundtable on AI, members of Congress express angst and fears of 'destruction'

Lawmakers expressed concerns about the implications of artificial intelligence on government data, military actions, and societal impacts during a congressional subcommittee roundtable.

Artificial intelligence

fromwww.bbc.com

12 hours ago

White House and Anthropic set aside court fight to meet amid fears over Mythos model

The White House met with Anthropic's CEO to discuss collaboration on AI technology amid ongoing legal issues with the Department of Defense.

Artificial intelligence

fromnews.bitcoin.com

23 hours ago

Anthropic Debuts Claude Opus 4.7 as Agentic Workflows Take Center Stage

Anthropic launched Claude Opus 4.7 on April 16, 2026, achieving an 87.6% score on the SWE-bench Verified test.

Artificial intelligence

fromPsychology Today

4 days ago

The ProSocial AI Index: A Better Way to Think About AI

AI's impact extends beyond technical efficiency; it must also support human values and flourishing.

Artificial intelligence

fromSecurityWeek

1 week ago

Can we Trust AI? No - But Eventually We Must

The reliance on AI in business poses risks due to its inaccuracies and the potential for exploitation by attackers.

more#artificial-intelligence

European startups

fromComputerworld

19 hours ago

UK wants to build sovereign AI - with just 0.08% of OpenAI's market cap

The UK government struggles to invest effectively in national IT champions, with past successes slipping out of UK ownership.

Marketing tech

fromAP News

1 day ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech companies like Google to enhance their defenses against malicious ads.

Privacy technologies

fromGadgets 360

2 days ago

Over 75 Privacy Orgs Urge Meta to Not Develop Facial Recognition Feature

Meta's development of AI-powered facial recognition for smart glasses has sparked privacy concerns, prompting 77 organizations to urge its halt.

fromFortune

2 days ago

The Sam Altman attack is putting two anti-AI groups under scrutiny-but the story is more complicated | Fortune

Pause AI, founded in Utrecht, Netherlands in May 2023 by Joep Meindertsma, aims to halt what it calls 'dangerous frontier AI' and staged its first protest outside Microsoft's lobbying office in Brussels.

Silicon Valley

Software development

fromZDNET

2 days ago

'Like handing out the blueprint to a bank vault': Why AI led one company to abandon open source

Cal is shifting from open source to proprietary licensing due to security risks posed by modern AI tools.

Online marketing

fromSearch Engine Roundtable

4 days ago

Google Warns Against Trying to Manipulate LLMs

Google is aware of self-serving listicles and actively works to combat manipulation in search results.

#openai

fromFuturism

5 days ago

Law

OpenAI Backing Law That Protects It When AI Causes Mass Deaths and Other Mayhem

fromAxios

3 days ago

Information security

OpenAI expands access to cyber AI as hacking risks grow

fromWIRED

3 days ago

Information security

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

fromwww.businessinsider.com

13 hours ago

Artificial intelligence

OpenAI loses 3 top executives as it cuts back on 'side quests'

Artificial intelligence

fromFortune

1 day ago

Attacks on Sam Altman's home are extreme. But the AI backlash is going mainstream | Fortune

OpenAI faces increasing public concern and backlash over AI's societal impacts, highlighted by recent violent incidents involving its CEO.

Artificial intelligence

fromTechCrunch

3 days ago

Anthropic's rise is giving some OpenAI investors second thoughts | TechCrunch

OpenAI's $852 billion valuation faces skepticism as it competes with Anthropic, which has seen significant revenue growth.

Law

fromFuturism

5 days ago

OpenAI Backing Law That Protects It When AI Causes Mass Deaths and Other Mayhem

Florida's attorney general investigates OpenAI for its potential role in a deadly school shooting influenced by ChatGPT conversations.

Information security

fromAxios

3 days ago

OpenAI expands access to cyber AI as hacking risks grow

OpenAI is shifting to a model that emphasizes identity verification for access to sensitive cybersecurity tools while expanding availability.

Information security

fromWIRED

3 days ago

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

OpenAI announced GPT-5.4-Cyber, emphasizing cybersecurity safeguards and the need for advanced protections in AI models.

Artificial intelligence

fromwww.businessinsider.com

13 hours ago

OpenAI loses 3 top executives as it cuts back on 'side quests'

OpenAI lost three top executives as it narrows focus and prepares for an IPO amid increasing competition from Anthropic.

Artificial intelligence

fromFortune

1 day ago

Attacks on Sam Altman's home are extreme. But the AI backlash is going mainstream | Fortune

OpenAI faces increasing public concern and backlash over AI's societal impacts, highlighted by recent violent incidents involving its CEO.

Artificial intelligence

fromTechCrunch

3 days ago

Anthropic's rise is giving some OpenAI investors second thoughts | TechCrunch

OpenAI's $852 billion valuation faces skepticism as it competes with Anthropic, which has seen significant revenue growth.

4 myths about AI in hiring, debunked

AI in hiring can reduce bias compared to human recruiters, challenging common misconceptions about its fairness.

Artificial intelligence

fromComputerWeekly.com

1 day ago

Welcome to agentic AI. Welcome to per-agent licensing | Computer Weekly

AI monetization remains a challenge despite high public awareness and competition among major tech players.

Intellectual property law

fromFuturism

1 hour ago

Things You Told ChatGPT or Claude My Have Already Doomed You in Court

AI chatbots are not protected by attorney-client privilege, as ruled by a New York federal judge in a case involving Brad Heppner.

Time for government, business leaders to figure out AI cybersecurity regulation - Harvard Gazette

Agentic AI poses both opportunities for cybersecurity and risks to personal data, economy, and national security, necessitating regulation by leaders.

UX design

fromSmashing Magazine

1 week ago

Identifying Necessary Transparency Moments In Agentic AI (Part 1) - Smashing Magazine

Designing for agentic AI requires balancing transparency and simplicity to build user trust without overwhelming them with information.

fromInfoWorld

2 months ago

Artificial intelligence

Agentic AI exposes what we're doing wrong

Information security

fromHarvard Gazette

17 hours ago

Time for government, business leaders to figure out AI cybersecurity regulation - Harvard Gazette

Agentic AI poses both opportunities for cybersecurity and risks to personal data, economy, and national security, necessitating regulation by leaders.

UX design

fromSmashing Magazine

1 week ago

Identifying Necessary Transparency Moments In Agentic AI (Part 1) - Smashing Magazine

Designing for agentic AI requires balancing transparency and simplicity to build user trust without overwhelming them with information.

fromInfoWorld

2 months ago

Artificial intelligence

Agentic AI exposes what we're doing wrong

more#agentic-ai

Privacy professionals

fromEngadget

2 days ago

Anthropic will ask Claude users to verify their identities 'for a few use cases'

Anthropic is implementing identity verification for certain capabilities on Claude, requiring users to provide a government-issued ID and a selfie.

Marketing tech

fromSan Diego Union-Tribune

1 day ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech companies to enhance their defenses against these threats.

Privacy technologies

fromPetaPixel

2 days ago

Apple and Google Direct Users to AI 'Nudify' Apps: Report

Apple and Google facilitate access to nudify apps that create deepfake nude images despite policies against nonconsensual sexualized content.

Software development

fromInfoWorld

2 days ago

Mastering the dull reality of sexy AI

The gap in enterprise AI lies in building effective systems for retrieval, evaluation, memory, and governance, not just access to models.

Media industry

fromTechCrunch

2 days ago

Exclusive: Can AI judge journalism? A Thiel-backed startup says yes, even if it risks chilling whistleblowers

Aron D'Souza's startup Objection uses AI to challenge journalism claims, aiming to restore trust in media.

Science

fromFast Company

1 week ago

Can artificial intelligence be governed-or will it govern us?

The advent of nuclear power marked a significant shift in technology, necessitating careful consideration and regulation to prevent recklessness.

US news

fromwww.npr.org

3 days ago

Law enforcement is trying to combat abusive AI. Experts say easier said than done

An Ohio man was convicted under the 2025 Take It Down Act for creating and distributing AI-generated abusive sexual images.

#ai-governance

fromBusiness Matters

6 days ago

Philosophy

The Naughty AI President: A New Age of Governance

fromFortune

22 hours ago

Artificial intelligence

AI cybersecurity capabilities require urgent international cooperation, AI godfather Bengio says | Fortune

Artificial intelligence

fromeLearning Industry

5 days ago

Custom AI Governance Services: The Missing Piece In Your L&D Strategy

Many L&D teams adopt AI tools without ensuring fairness, transparency, and accountability in their training programs.

Artificial intelligence

fromMarTech

2 weeks ago

Your AI governance gap is bigger than you think | MarTech

AI governance is an immediate challenge for leaders, focusing on safe and effective usage across organizations.

Artificial intelligence

fromTNW | Artificial-Intelligence

4 weeks ago

AI analytics agents need guardrails, not more model size

Larger AI models cannot solve enterprise governance and data consistency problems; organizations need governed analytics environments with semantic consistency to ensure reliable AI-driven insights.

Philosophy

fromBusiness Matters

6 days ago

The Naughty AI President: A New Age of Governance

AI governance may create a ruler that learns to manipulate systems rather than simply follow them.

Artificial intelligence

fromFortune

22 hours ago

AI cybersecurity capabilities require urgent international cooperation, AI godfather Bengio says | Fortune

Yoshua Bengio emphasizes the urgent need for international cooperation in addressing AI's risks, particularly with the release of Anthropic's Mythos model.

Artificial intelligence

fromeLearning Industry

5 days ago

Custom AI Governance Services: The Missing Piece In Your L&D Strategy

Many L&D teams adopt AI tools without ensuring fairness, transparency, and accountability in their training programs.

Artificial intelligence

fromMarTech

2 weeks ago

Your AI governance gap is bigger than you think | MarTech

AI governance is an immediate challenge for leaders, focusing on safe and effective usage across organizations.

Artificial intelligence

fromTNW | Artificial-Intelligence

4 weeks ago

AI analytics agents need guardrails, not more model size

Larger AI models cannot solve enterprise governance and data consistency problems; organizations need governed analytics environments with semantic consistency to ensure reliable AI-driven insights.

The Death of an AI Whistleblower

Suchir Balaji, a whistleblower against OpenAI, claimed the company violated copyright laws by using vast amounts of internet data for its AI models.

Information security

fromSecuritymagazine

2 days ago

What Are Security Experts Saying About OpenAI's GPT-5.4-Cyber?

OpenAI launched GPT-5.4-Cyber for cybersecurity, offering broad access to defenders while emphasizing safety and continuous improvement.

Data science

fromTheregister

2 days ago

Bad teacher bots can leave hidden marks on model students

Teaching LLMs using outputs from other models can transmit undesirable traits subliminally, even if those traits are removed from training data.

Privacy professionals

fromExtremeTech

1 day ago

Google, Microsoft, and Meta Ignore Your Ad Tracking Opt-Outs, Audit Reveals

Google, Microsoft, and Meta track users' browsing habits despite opt-out requests, violating privacy regulations.

Business intelligence

fromPrivacy International

2 weeks ago

Transparency and explainability for algorithmic decisions at work

Algorithmic transparency and explainability are essential for protecting workers' rights and improving accountability in workplace management systems.

#meta

Privacy technologies

fromwww.socialmediatoday.com

5 days ago

Advocacy groups warn against adding facial recognition to Meta AI glasses

Meta's AI glasses face backlash from advocacy groups over privacy concerns related to facial recognition technology.

Privacy professionals

fromFuturism

4 days ago

Huge Group of Experts Warns Meta That Its Pervert Glasses Will Enable Terrible Crimes

Meta's Ray-Ban AI glasses face backlash for privacy violations and plans for facial recognition technology, prompting outrage from civil rights groups.

Privacy technologies

fromwww.socialmediatoday.com

5 days ago

Advocacy groups warn against adding facial recognition to Meta AI glasses

Meta's AI glasses face backlash from advocacy groups over privacy concerns related to facial recognition technology.

Privacy professionals

fromFuturism

4 days ago

Huge Group of Experts Warns Meta That Its Pervert Glasses Will Enable Terrible Crimes

Meta's Ray-Ban AI glasses face backlash for privacy violations and plans for facial recognition technology, prompting outrage from civil rights groups.

How AI Interfaces Are Reshaping Discovery, Trust And Decision Making

The traditional home page is losing its significance as AI assistants reshape how users interact with brands online.

DevOps

fromInfoWorld

3 weeks ago

7 safeguards for observable AI agents

DevOps teams must implement observability standards to manage AI agents effectively and avoid technical debt.

Data science

fromNature

2 weeks ago

The hidden costs of 'helpful' AI

Compatibility with human judgment is more crucial than AI power in collaborative tasks.

Artificial intelligence

fromAxios

15 hours ago

Scoop: Bessent and Wiles met Anthropic's Amodei in sign of thaw

The White House meeting with Anthropic aimed to address AI technology challenges and explore collaboration opportunities.

Artificial intelligence

fromTechRepublic

23 hours ago

AI Upgrades, Security Breaches, and Industry Shifts Define This Week in Tech - TechRepublic

AI innovation and security threats are reshaping technology and corporate strategies across various platforms and applications.

Artificial intelligence

fromThe Verge

17 hours ago

Anthropic's new cybersecurity model could get it back in the government's good graces

Anthropic's relationship with the Trump administration has improved due to its new cybersecurity model, Claude Mythos Preview.

Artificial intelligence

fromWIRED

2 days ago

AI Could Democratize One of Tech's Most Valuable Resources

Nvidia faces potential competition as startups like Wafer optimize AI code for various chips, challenging its dominance in AI hardware.

Artificial intelligence

fromInfoWorld

1 day ago

Anthropic's latest model is deliberately less powerful than Mythos (and that's the point)

Claude Opus 4.7 enhances performance and usability while prioritizing safety over capability compared to the upcoming Claude Mythos model.

Artificial intelligence

fromMIT Technology Review

3 days ago

Building trust in the AI era with privacy-led UX

Well-designed consent experiences enhance trust and business performance, evolving privacy into an ongoing relationship rather than a one-time transaction.

Artificial intelligence

fromThe Hacker News

3 days ago

Deterministic + Agentic AI: The Architecture Exposure Validation Requires

AI is rapidly being integrated into security functions across organizations, with a focus on adaptive testing methods.

Artificial intelligence

fromWIRED

1 day ago

The Battle for OpenAI's Soul

Elon Musk's lawsuit against Sam Altman will determine OpenAI's adherence to its founding mission and impact its corporate future.

UX design

fromMedium

1 month ago

Designing at the edge of AI harm

The terminology shift from 'human' to 'user' to 'customer' represents a progressive dehumanization that commodifies human data while obscuring ethical implications in technology design.

#ai-models

Artificial intelligence

fromFortune

2 days ago

Moody's CEO: AI has a trust problem - better models won't fix it | Fortune

Trust in data and intelligence is crucial for businesses adopting AI models.

Artificial intelligence

fromTheregister

6 days ago

The AI divide putting open weights models in spotlight

Open weights AI models are evolving from research projects to serious enterprise products, highlighting a growing divide between enterprise and frontier AI.

Artificial intelligence

fromFortune

2 days ago

Moody's CEO: AI has a trust problem - better models won't fix it | Fortune

Trust in data and intelligence is crucial for businesses adopting AI models.

Artificial intelligence

fromTheregister

6 days ago

The AI divide putting open weights models in spotlight

Open weights AI models are evolving from research projects to serious enterprise products, highlighting a growing divide between enterprise and frontier AI.

more#ai-models

Artificial intelligence

fromThe Verge

3 days ago

The attacks on Sam Altman are a warning for the AI world

Recent attacks against AI figures highlight escalating fears and resistance, though most opposition remains nonviolent.

Artificial intelligence

fromEngadget

2 days ago

There's yet another study about how bad AI is for our brains

AI assistance improves immediate performance but creates dependency, leading to decreased persistence and independent performance when the technology is removed.

Artificial intelligence

fromAbove the Law

3 days ago

What Lawyers Need To Know About Anthropic's Mythos - Above the Law

Anthropic's new AI model, Claude Mythos, uncovers significant security vulnerabilities, raising concerns about its potential impact on cybersecurity.

Marketing tech

fromExchangewire

2 months ago

The Stack: AI and Accountability

Regulation, AI investment, and platform monetisation are reshaping advertising, driving legal, commercial, and government use of ad tech while UK ad spend rises.

Artificial intelligence

fromEntrepreneur

1 week ago

Anthropic Warns Its New AI Could Enable 'Weapons We Can't Even Envision.' Skeptics Aren't Buying It.

Anthropic's Claude Mythos model poses significant risks, leading to restricted access for only select companies due to its potential for catastrophic exploitation.

#ai-ethics

Artificial intelligence

fromTheregister

2 weeks ago

AI models will deceive you to save their own kind

AI models may engage in deception to protect their peers, raising concerns about their decision-making and potential risks to humans.

Artificial intelligence

fromComputerworld

2 weeks ago

Why AI lies, cheats and steals

AI chatbots are increasingly misbehaving, with a fivefold rise in unethical actions over six months, according to recent research.

fromElectronic Frontier Foundation

2 months ago

Artificial intelligence

Smart AI Policy Means Examing Its Real Harms and Benefits

Artificial intelligence

fromTheregister

2 weeks ago

AI models will deceive you to save their own kind

AI models may engage in deception to protect their peers, raising concerns about their decision-making and potential risks to humans.

Artificial intelligence

fromComputerworld

2 weeks ago

Why AI lies, cheats and steals

AI chatbots are increasingly misbehaving, with a fivefold rise in unethical actions over six months, according to recent research.

fromElectronic Frontier Foundation

2 months ago

Artificial intelligence

Smart AI Policy Means Examing Its Real Harms and Benefits

more#ai-ethics

Artificial intelligence

fromComputerWeekly.com

1 month ago

Is AI our agent, or are our governments becoming agents for AI? | Computer Weekly

Meta's acquisition of Moltbook, a social network for AI agents, raises serious security concerns given recent research documenting critical vulnerabilities in AI agent interactions including unauthorized compliance, data disclosure, and system takeover risks.

Artificial intelligence

fromIPWatchdog.com | Patents & Intellectual Property Law

1 month ago

The AI Ethics Waterfall: Disclosure, Governance, and Who's Really Responsible

AI integration in patent practice is now ubiquitous across all stages, from invention harvesting to litigation, with much of it operating invisibly within existing tools and platforms.

fromPsychology Today

2 months ago

The Tragic Flaw in AI

One of the strangest things about large language models is not what they get wrong, but what they assume to be correct. LLMs behave as if every question already has an answer. It's as if reality itself is always a kind of crossword puzzle. The clues may be hard, the grid may be vast and complex, but the solution is presumed to exist. Somewhere, just waiting to be filled in.

Artificial intelligence

[ Load more ]

#ethical-ai-safeguards#ethical-ai-safeguards

Daily briefing: AI systems can 'teach' biases to other models

AI models 'subliminally' transmit unsafe behaviours when training other systems

Daily briefing: AI systems can 'teach' biases to other models

AI models 'subliminally' transmit unsafe behaviours when training other systems

There Are Signs of a Massive AI Backlash

What is Claude Mythos and what risks does it pose?

The 'AI is inevitable' trap

An SEO's guide to ethical AI use

OpenAI Widens Access to Cybersecurity Model After Anthropic's Mythos Reveal

Make bad moves on AI and face voter backlash, govts warned

OpenAI's policy chief says AI companies 'need to do a much better job' talking about AI as industry leaders face personal attacks | Fortune

What is Claude Mythos and what risks does it pose?

The 'AI is inevitable' trap

An SEO's guide to ethical AI use

OpenAI Widens Access to Cybersecurity Model After Anthropic's Mythos Reveal

Make bad moves on AI and face voter backlash, govts warned

OpenAI's policy chief says AI companies 'need to do a much better job' talking about AI as industry leaders face personal attacks | Fortune

AI needs a reality check

The trust gap in healthcare AI isn't about the AI

AI needs a reality check

The trust gap in healthcare AI isn't about the AI

Illinois is OpenAI and Anthropic's latest battleground as state tries to assess liability for catastrophes caused by AI | Fortune

Anthropic Opposes the Extreme AI Liability Bill That OpenAI Backed

Illinois is OpenAI and Anthropic's latest battleground as state tries to assess liability for catastrophes caused by AI | Fortune

Anthropic Opposes the Extreme AI Liability Bill That OpenAI Backed

A New Kind of Scandal Is Growing Online. It's Ruining Careers-and Aimed at the Wrong Target.

Atlassian to train AI on user data unless law or cash say no

The Labor Department wants to teach you to use AI more. Here's what we found

What AI Can't Calculate About a Human Life

The stigma around AI in journalism may be easing, but trust is still fragile

Lawmakers Gathered Quietly to Talk About AI. Angst and Fears of 'Destruction' Followed

At roundtable on AI, members of Congress express angst and fears of 'destruction'

White House and Anthropic set aside court fight to meet amid fears over Mythos model

Anthropic Debuts Claude Opus 4.7 as Agentic Workflows Take Center Stage

The ProSocial AI Index: A Better Way to Think About AI

Can we Trust AI? No - But Eventually We Must

Lawmakers Gathered Quietly to Talk About AI. Angst and Fears of 'Destruction' Followed

At roundtable on AI, members of Congress express angst and fears of 'destruction'

White House and Anthropic set aside court fight to meet amid fears over Mythos model

Anthropic Debuts Claude Opus 4.7 as Agentic Workflows Take Center Stage

The ProSocial AI Index: A Better Way to Think About AI

Can we Trust AI? No - But Eventually We Must

UK wants to build sovereign AI - with just 0.08% of OpenAI's market cap

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Over 75 Privacy Orgs Urge Meta to Not Develop Facial Recognition Feature

The Sam Altman attack is putting two anti-AI groups under scrutiny-but the story is more complicated | Fortune

'Like handing out the blueprint to a bank vault': Why AI led one company to abandon open source

Google Warns Against Trying to Manipulate LLMs

OpenAI Backing Law That Protects It When AI Causes Mass Deaths and Other Mayhem

OpenAI expands access to cyber AI as hacking risks grow

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

OpenAI loses 3 top executives as it cuts back on 'side quests'

Attacks on Sam Altman's home are extreme. But the AI backlash is going mainstream | Fortune

Anthropic's rise is giving some OpenAI investors second thoughts | TechCrunch

OpenAI Backing Law That Protects It When AI Causes Mass Deaths and Other Mayhem

OpenAI expands access to cyber AI as hacking risks grow

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

OpenAI loses 3 top executives as it cuts back on 'side quests'

Attacks on Sam Altman's home are extreme. But the AI backlash is going mainstream | Fortune

Anthropic's rise is giving some OpenAI investors second thoughts | TechCrunch

4 myths about AI in hiring, debunked

Welcome to agentic AI. Welcome to per-agent licensing | Computer Weekly

Things You Told ChatGPT or Claude My Have Already Doomed You in Court

Time for government, business leaders to figure out AI cybersecurity regulation - Harvard Gazette

Identifying Necessary Transparency Moments In Agentic AI (Part 1) - Smashing Magazine

Agentic AI exposes what we're doing wrong

Time for government, business leaders to figure out AI cybersecurity regulation - Harvard Gazette

Identifying Necessary Transparency Moments In Agentic AI (Part 1) - Smashing Magazine

Agentic AI exposes what we're doing wrong

Anthropic will ask Claude users to verify their identities 'for a few use cases'

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Apple and Google Direct Users to AI 'Nudify' Apps: Report

Mastering the dull reality of sexy AI

Exclusive: Can AI judge journalism? A Thiel-backed startup says yes, even if it risks chilling whistleblowers

Can artificial intelligence be governed-or will it govern us?

Law enforcement is trying to combat abusive AI. Experts say easier said than done

The Naughty AI President: A New Age of Governance

AI cybersecurity capabilities require urgent international cooperation, AI godfather Bengio says | Fortune

Custom AI Governance Services: The Missing Piece In Your L&D Strategy

#ethical-ai-safeguards
#ethical-ai-safeguards