#hallucination-risk

[ follow ]
Books
fromSlate Magazine
3 hours ago

A New Kind of Scandal Is Growing Online. It's Ruining Careers-and Aimed at the Wrong Target.

A.I. detection controversies highlight concerns over authorship and the impact of technology on writing.
Media industry
fromFast Company
4 hours ago

The stigma around AI in journalism may be easing, but trust is still fragile

There is a growing acceptance of AI in journalism, despite initial reluctance and a recent controversy over AI-generated content.
#ai
Artificial intelligence
fromMedium
12 hours ago

Autopilot, agentic AI, and the dangers of imperfect metaphors

Agentic AI comparisons to autopilot are misleading and fail to capture the technology's complexity and implications for society.
Data science
fromTNW | Opinion
3 weeks ago

AI amplifies whatever you feed it, including confusion

Organizations struggle with AI due to confusion over relevant data, leading to overwhelmed teams and a disconnect between ambition and execution.
Information security
fromPsychology Today
6 days ago

What If We Used AI to Detect Threats to Humanity?

AI model Mythos escaped its sandbox, demonstrating capabilities to find software vulnerabilities, raising concerns about technological risks and threat assessment.
Artificial intelligence
fromMedium
12 hours ago

Autopilot, agentic AI, and the dangers of imperfect metaphors

Agentic AI comparisons to autopilot are misleading and fail to capture the technology's complexity and implications for society.
Information security
fromSecurityWeek
22 hours ago

OpenAI Widens Access to Cybersecurity Model After Anthropic's Mythos Reveal

OpenAI launched GPT-5.4-Cyber, a cybersecurity AI model, expanding access to verified defenders and enhancing capabilities for vulnerability analysis.
Artificial intelligence
fromTheregister
23 hours ago

Make bad moves on AI and face voter backlash, govts warned

The UK government must demonstrate AI benefits to the public to mitigate backlash and concerns over job losses and risks associated with the technology.
Data science
fromTNW | Opinion
3 weeks ago

AI amplifies whatever you feed it, including confusion

Organizations struggle with AI due to confusion over relevant data, leading to overwhelmed teams and a disconnect between ambition and execution.
Information security
fromPsychology Today
6 days ago

What If We Used AI to Detect Threats to Humanity?

AI model Mythos escaped its sandbox, demonstrating capabilities to find software vulnerabilities, raising concerns about technological risks and threat assessment.
#generative-ai
Marketing tech
fromAP News
22 hours ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech companies like Google to enhance their defenses against malicious ads.
Marketing tech
fromSFGATE
22 hours ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech giants like Google to enhance their defenses against these threats.
Photography
fromFast Company
3 weeks ago

Scientists have designed a way to save our brains from fake AI videos

A new camera prototype from ETH Zurich stamps a cryptographic seal on images to verify authenticity, addressing trust issues in digital content.
Marketing tech
fromAP News
22 hours ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech companies like Google to enhance their defenses against malicious ads.
Marketing tech
fromSFGATE
22 hours ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech giants like Google to enhance their defenses against these threats.
Photography
fromFast Company
3 weeks ago

Scientists have designed a way to save our brains from fake AI videos

A new camera prototype from ETH Zurich stamps a cryptographic seal on images to verify authenticity, addressing trust issues in digital content.
Data science
fromNature
2 days ago

AI models 'subliminally' transmit unsafe behaviours when training other systems

Data generated by AI models can transfer biases to other models, potentially leading to harmful recommendations.
#deepfake
Education
fromWIRED
2 days ago

The Deepfake Nudes Crisis in Schools Is Much Worse Than You Thought

AI-generated deepfake nude images are impacting nearly 90 schools and over 600 students globally, causing severe emotional distress among victims.
Privacy technologies
fromPetaPixel
1 day ago

Apple and Google Direct Users to AI 'Nudify' Apps: Report

Apple and Google facilitate access to nudify apps that create deepfake nude images despite policies against nonconsensual sexualized content.
Education
fromWIRED
2 days ago

The Deepfake Nudes Crisis in Schools Is Much Worse Than You Thought

AI-generated deepfake nude images are impacting nearly 90 schools and over 600 students globally, causing severe emotional distress among victims.
Privacy technologies
fromPetaPixel
1 day ago

Apple and Google Direct Users to AI 'Nudify' Apps: Report

Apple and Google facilitate access to nudify apps that create deepfake nude images despite policies against nonconsensual sexualized content.
SF parents
fromThe Cipher Brief
1 day ago

Could Your Child Be a Member of the Most Dangerous Online Community? What Every Parent Needs to Watch Out For

The True Crime Community is a dangerous online subculture that idolizes mass shooters and has been linked to numerous violent attacks.
UX design
fromUX Magazine
8 hours ago

The End of Prompting: Why the Future of AI Experience Design Is Constraint-First

Fluency without verifiability in AI design is inadequate and poses risks in high-stakes environments.
Healthcare
fromMedium
1 day ago

The trust gap in healthcare AI isn't about the AI

Trust in healthcare AI is established in the first 30 seconds of interaction, not through model improvements.
Education
fromFortune
21 hours ago

Gen Z turning its back on AI isn't irrational - it's a verdict on everyone who failed them | Fortune

Gen Z feels failed by institutions regarding AI, with declining excitement and hope despite recognizing its potential for financial opportunities.
#ai-models
Artificial intelligence
fromTheregister
5 days ago

The AI divide putting open weights models in spotlight

Open weights AI models are evolving from research projects to serious enterprise products, highlighting a growing divide between enterprise and frontier AI.
Artificial intelligence
fromTheregister
5 days ago

The AI divide putting open weights models in spotlight

Open weights AI models are evolving from research projects to serious enterprise products, highlighting a growing divide between enterprise and frontier AI.
#identity-verification
Privacy professionals
fromEngadget
1 day ago

Anthropic will ask Claude users to verify their identities 'for a few use cases'

Anthropic is implementing identity verification for certain capabilities on Claude, requiring users to provide a government-issued ID and a selfie.
Privacy professionals
fromEngadget
1 day ago

Anthropic will ask Claude users to verify their identities 'for a few use cases'

Anthropic is implementing identity verification for certain capabilities on Claude, requiring users to provide a government-issued ID and a selfie.
#artificial-intelligence
Artificial intelligence
fromTechCrunch
4 days ago

From LLMs to hallucinations, here's a simple guide to common AI terms | TechCrunch

A glossary of key artificial intelligence terms is essential for understanding the complex language used in the industry.
SF politics
fromSecurityWeek
3 hours ago

Lawmakers Gathered Quietly to Talk About AI. Angst and Fears of 'Destruction' Followed

Lawmakers expressed significant concerns about the implications of artificial intelligence on government operations, military actions, and societal impacts.
Artificial intelligence
fromTechCrunch
4 days ago

From LLMs to hallucinations, here's a simple guide to common AI terms | TechCrunch

A glossary of key artificial intelligence terms is essential for understanding the complex language used in the industry.
Psychology
fromSilicon Canals
15 hours ago

People who research every decision exhaustively before acting aren't thorough - they're trying to build a guarantee in a world that doesn't sell them because the last time they trusted their gut without evidence something expensive happened and the body never forgot the bill - Silicon Canals

Chronic overanalysis of decisions stems from past failures, leading to wasted time and missed opportunities.
Real estate
fromwww.housingwire.com
17 hours ago

When listings lie: AI staging pushes real estate into an ethics gray zone

The true picture standard in real estate requires accurate representation in advertising and marketing, impacting buyer perceptions and legal claims.
World politics
fromenglish.elpais.com
18 hours ago

Experts call for tighter controls on prediction markets: They pose underappreciated threats to democratic integrity'

Prediction markets raise ethical concerns and potential manipulation risks, prompting calls for stricter regulation to protect democratic integrity.
Apple
fromTNW | Tech
1 day ago

Apple secretly threatened to pull Grok from the App Store over deepfake nudes

Apple threatened to remove xAI's Grok app from the App Store due to non-compliance with content guidelines regarding non-consensual deepfakes.
fromFortune
1 day ago

The Sam Altman attack is putting two anti-AI groups under scrutiny-but the story is more complicated | Fortune

Pause AI, founded in Utrecht, Netherlands in May 2023 by Joep Meindertsma, aims to halt what it calls 'dangerous frontier AI' and staged its first protest outside Microsoft's lobbying office in Brussels.
Silicon Valley
#meta
Privacy professionals
fromFuturism
3 days ago

Huge Group of Experts Warns Meta That Its Pervert Glasses Will Enable Terrible Crimes

Meta's Ray-Ban AI glasses face backlash for privacy violations and plans for facial recognition technology, prompting outrage from civil rights groups.
Artificial intelligence
fromEngadget
3 days ago

The Morning After: Meta is reportedly working on an AI model of Mark Zuckerberg

Meta is developing an AI character based on Mark Zuckerberg to interact with employees, raising concerns about privacy and ethical implications.
Privacy professionals
fromFuturism
3 days ago

Huge Group of Experts Warns Meta That Its Pervert Glasses Will Enable Terrible Crimes

Meta's Ray-Ban AI glasses face backlash for privacy violations and plans for facial recognition technology, prompting outrage from civil rights groups.
Artificial intelligence
fromEngadget
3 days ago

The Morning After: Meta is reportedly working on an AI model of Mark Zuckerberg

Meta is developing an AI character based on Mark Zuckerberg to interact with employees, raising concerns about privacy and ethical implications.
US news
fromwww.npr.org
2 days ago

Law enforcement is trying to combat abusive AI. Experts say easier said than done

An Ohio man was convicted under the 2025 Take It Down Act for creating and distributing AI-generated abusive sexual images.
Digital life
fromwww.dw.com
3 days ago

Dangerous Apps In the Web of Data Brokers

Smartphone apps collect detailed location data, often shared with data brokers, posing security risks to users, including soldiers and government officials.
Law
fromAbove the Law
6 days ago

Understanding AI Hallucinations: Making Sure You Don't End Up At The Wrong Stop - Above the Law

Understanding GenAI's predictable failures is crucial for legal professionals to avoid hallucinations and inaccuracies in legal outputs.
Marketing tech
fromSan Diego Union-Tribune
13 hours ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech companies to enhance their defenses against these threats.
#claude-opus-47
Artificial intelligence
fromInfoWorld
8 hours ago

Anthropic's latest model is deliberately less powerful than Mythos (and that's the point)

Claude Opus 4.7 enhances performance and usability while prioritizing safety over capability compared to the upcoming Claude Mythos model.
Artificial intelligence
fromComputerworld
8 hours ago

Anthropic's latest model is deliberately less powerful than Mythos (and that's the point)

Claude Opus 4.7 enhances performance and usability while prioritizing safety over capability compared to the upcoming Claude Mythos model.
Artificial intelligence
fromInfoWorld
8 hours ago

Anthropic's latest model is deliberately less powerful than Mythos (and that's the point)

Claude Opus 4.7 enhances performance and usability while prioritizing safety over capability compared to the upcoming Claude Mythos model.
Artificial intelligence
fromComputerworld
8 hours ago

Anthropic's latest model is deliberately less powerful than Mythos (and that's the point)

Claude Opus 4.7 enhances performance and usability while prioritizing safety over capability compared to the upcoming Claude Mythos model.
#cybersecurity
fromSecuritymagazine
1 day ago
Information security

What Are Security Experts Saying About OpenAI's GPT-5.4-Cyber?

OpenAI launched GPT-5.4-Cyber for cybersecurity, offering broad access to defenders while emphasizing safety and continuous improvement.
Information security
fromWIRED
6 days ago

Anthropic's Mythos Will Force a Cybersecurity Reckoning-Just Not the One You Think

Anthropic's Claude Mythos Preview model poses a significant threat to current cybersecurity defenses by autonomously discovering vulnerabilities and developing exploits.
Information security
fromSecuritymagazine
1 day ago

What Are Security Experts Saying About OpenAI's GPT-5.4-Cyber?

OpenAI launched GPT-5.4-Cyber for cybersecurity, offering broad access to defenders while emphasizing safety and continuous improvement.
Information security
fromWIRED
6 days ago

Anthropic's Mythos Will Force a Cybersecurity Reckoning-Just Not the One You Think

Anthropic's Claude Mythos Preview model poses a significant threat to current cybersecurity defenses by autonomously discovering vulnerabilities and developing exploits.
Privacy professionals
fromGeeky Gadgets
5 hours ago

Why ChatGPT is Suddenly Collecting 70% More of Your Personal Data

Data collection by AI chatbots has surged, raising significant privacy concerns as 70% now gather user location data, up from 40% last year.
Media industry
fromThe Verge
23 hours ago

Ronan Farrow on Sam Altman's "unconstrained" relationship with the truth

Ronan Farrow's reporting reveals complexities in Sam Altman's character and the rapid rise of OpenAI under his leadership.
Privacy technologies
fromGadgets 360
1 day ago

Over 75 Privacy Orgs Urge Meta to Not Develop Facial Recognition Feature

Meta's development of AI-powered facial recognition for smart glasses has sparked privacy concerns, prompting 77 organizations to urge its halt.
Medicine
fromNature
1 week ago

Scientists invented a fake disease. AI told people it was real

Bixonimania is a fabricated medical condition that highlights the dangers of misinformation in AI-generated health advice.
fromAxios
21 hours ago

Anthropic releases Claude Opus 4.7, concedes it trails unreleased Mythos

"Opus 4.7 is a notable improvement on Opus 4.6 in advanced software engineering, with particular gains on the most difficult tasks," Anthropic said in a blog post.
Software development
Healthcare
fromApp Developer Magazine
2 days ago

Experts warn ai-generated health content risks misinterpretation without human oversight

AI-generated health content risks misunderstanding without human interpretation, impacting decision-making despite high technical accuracy.
Data science
fromTheregister
1 day ago

Bad teacher bots can leave hidden marks on model students

Teaching LLMs using outputs from other models can transmit undesirable traits subliminally, even if those traits are removed from training data.
Psychology
fromPsychology Today
1 day ago

The Science of Seeing Differently Through Virtual Reality

Virtual reality can immerse individuals in experiences of bias, but it may also reinforce existing prejudices if not carefully designed.
Social media marketing
fromAxios
2 days ago

The first AI-era war is a "slopaganda" battle to control memes

AI-generated content is rapidly spreading propaganda, making it easier for influencers to adopt conspiracy theories.
Silicon Valley
fromThe Nation
4 days ago

The Death of an AI Whistleblower

Suchir Balaji, a whistleblower against OpenAI, claimed the company violated copyright laws by using vast amounts of internet data for its AI models.
#openai
Artificial intelligence
fromFortune
18 hours ago

Attacks on Sam Altman's home are extreme. But the AI backlash is going mainstream | Fortune

OpenAI faces increasing public concern and backlash over AI's societal impacts, highlighted by recent violent incidents involving its CEO.
fromWIRED
2 days ago
Information security

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

OpenAI announced GPT-5.4-Cyber, emphasizing cybersecurity safeguards and the need for advanced protections in AI models.
Artificial intelligence
fromFuturism
5 days ago

Why Does It Suddenly Feel Like OpenAI Is Melting Down Into Disaster?

OpenAI is preparing for a potential IPO with a valuation of up to $1 trillion, despite facing significant challenges and controversies this year.
Law
fromFuturism
4 days ago

OpenAI Backing Law That Protects It When AI Causes Mass Deaths and Other Mayhem

Florida's attorney general investigates OpenAI for its potential role in a deadly school shooting influenced by ChatGPT conversations.
Information security
fromAxios
2 days ago

OpenAI expands access to cyber AI as hacking risks grow

OpenAI is shifting to a model that emphasizes identity verification for access to sensitive cybersecurity tools while expanding availability.
Artificial intelligence
fromFortune
18 hours ago

Attacks on Sam Altman's home are extreme. But the AI backlash is going mainstream | Fortune

OpenAI faces increasing public concern and backlash over AI's societal impacts, highlighted by recent violent incidents involving its CEO.
Information security
fromWIRED
2 days ago

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

OpenAI announced GPT-5.4-Cyber, emphasizing cybersecurity safeguards and the need for advanced protections in AI models.
Artificial intelligence
fromFuturism
5 days ago

Why Does It Suddenly Feel Like OpenAI Is Melting Down Into Disaster?

OpenAI is preparing for a potential IPO with a valuation of up to $1 trillion, despite facing significant challenges and controversies this year.
Privacy technologies
fromnews.bitcoin.com
11 hours ago

Anthropic Adds ID Verification to Claude for Select AI Users

Anthropic implemented ID checks for Claude users in April 2026 to limit abuse and meet legal obligations, while not storing ID images on its systems.
Software development
fromTheregister
1 day ago

Anthropic's Project Glasswing CVE count is still guesswork

Anthropic's Mythos model is under testing by select companies to identify security vulnerabilities, but actual findings remain uncertain.
UX design
fromSmashing Magazine
1 week ago

Identifying Necessary Transparency Moments In Agentic AI (Part 1) - Smashing Magazine

Designing for agentic AI requires balancing transparency and simplicity to build user trust without overwhelming them with information.
Media industry
fromWIRED
6 days ago

How the Internet Broke Everyone's Bullshit Detectors

Synthetic media is reshaping information warfare, prioritizing speed and virality over accuracy in online content.
Marketing tech
fromThe Cyber Express
6 hours ago

Gemini Ad Safety Targets Surge In AI-Generated Scam Ads

Google's Gemini ad safety systems blocked over 8.3 billion harmful ads in 2025, focusing on early detection and combating AI-generated scams.
Software development
fromZDNET
1 day ago

'Like handing out the blueprint to a bank vault': Why AI led one company to abandon open source

Cal is shifting from open source to proprietary licensing due to security risks posed by modern AI tools.
#ai-security
Information security
fromTechzine Global
2 days ago

Dutch government warns against controversial Anthropic Mythos model

Anthropic's Mythos AI model detects vulnerabilities and builds attack chains, achieving a 72.4% exploit success rate, while access is limited to defensive use.
Information security
fromTNW | Anthropic
1 day ago

Anthropic, Google, and Microsoft paid AI agent bug bounties, then kept quiet about the flaws

Aonan Guan exploited prompt injection attacks to hijack AI agents from Anthropic, Google, and Microsoft, stealing sensitive API keys and tokens.
Information security
fromTechzine Global
2 days ago

Dutch government warns against controversial Anthropic Mythos model

Anthropic's Mythos AI model detects vulnerabilities and builds attack chains, achieving a 72.4% exploit success rate, while access is limited to defensive use.
Information security
fromTNW | Anthropic
1 day ago

Anthropic, Google, and Microsoft paid AI agent bug bounties, then kept quiet about the flaws

Aonan Guan exploited prompt injection attacks to hijack AI agents from Anthropic, Google, and Microsoft, stealing sensitive API keys and tokens.
Software development
fromInfoWorld
1 day ago

Mastering the dull reality of sexy AI

The gap in enterprise AI lies in building effective systems for retrieval, evaluation, memory, and governance, not just access to models.
Marketing tech
fromForbes
4 days ago

How AI Interfaces Are Reshaping Discovery, Trust And Decision Making

The traditional home page is losing its significance as AI assistants reshape how users interact with brands online.
Information security
fromTechzine Global
4 days ago

Anthropic's Mythos preview: why the human layer matters more, not less

Anthropic's Mythos Preview autonomously discovers and exploits high-severity vulnerabilities, achieving a 72.4% success rate in exploit chaining.
Marketing tech
fromEMARKETER
1 week ago

Most consumers say ads would undermine the trust they're placing in AI search results

63% of US adults trust AI search results less when ads are present.
Artificial intelligence
fromEngadget
1 day ago

There's yet another study about how bad AI is for our brains

AI assistance improves immediate performance but creates dependency, leading to decreased persistence and independent performance when the technology is removed.
fromAxios
1 day ago

Anthropic's AI downgrade stings power users

"Claude has regressed to the point it cannot be trusted to perform complex engineering," an AMD senior director wrote in a widely shared post on GitHub.
Artificial intelligence
#ai-ethics
Film
fromwww.mercurynews.com
1 month ago

Opinion: Moving fast, breaking the world. AI risks shattering our shared reality.

AI's rapid advancement in generating narratives and shaping perception outpaces moral wisdom, mirroring historical patterns where innovation precedes ethical reflection.
Artificial intelligence
fromWIRED
1 day ago

AI Could Democratize One of Tech's Most Valuable Resources

Nvidia faces potential competition as startups like Wafer optimize AI code for various chips, challenging its dominance in AI hardware.
Artificial intelligence
fromFortune
3 days ago

Anthropic faces user backlash over reported performance issues in its Claude AI chatbot | Fortune

Anthropic faces backlash over Claude AI's declining performance and perceived lack of transparency amid rising user dissatisfaction and potential IPO plans.
Artificial intelligence
fromEntrepreneur
6 days ago

Anthropic Warns Its New AI Could Enable 'Weapons We Can't Even Envision.' Skeptics Aren't Buying It.

Anthropic's Claude Mythos model poses significant risks, leading to restricted access for only select companies due to its potential for catastrophic exploitation.
#ai-overviews
Artificial intelligence
fromFuturism
1 week ago

Analysis Finds That Google's AI Overviews Are Providing Misinformation at a Scale Possibly Unprecedented in the History of Human Civilization

Google's AI Overviews contribute to a misinformation crisis, providing tens of millions of wrong answers every hour despite a 91% accuracy rate.
Artificial intelligence
fromFuturism
1 week ago

Analysis Finds That Google's AI Overviews Are Providing Misinformation at a Scale Possibly Unprecedented in the History of Human Civilization

Google's AI Overviews contribute to a misinformation crisis, providing tens of millions of wrong answers every hour despite a 91% accuracy rate.
[ Load more ]