#llm-safety-and-guardrails

[ follow ]
#ai
fromFast Company
2 days ago
Artificial intelligence

No, McDonald's AI bot didn't go rogue, but 'prompt injection' is still a risk for companies

Artificial intelligence
fromFast Company
3 days ago

Stop using AI as a scapegoat, and do this instead

Leaders use AI to justify layoffs, eroding trust and damaging workplace culture despite employees recognizing the disconnect between rhetoric and reality.
Data science
fromInfoWorld
1 month ago

A data trust scoring framework for reliable and responsible AI systems

A rigorous trust scoring framework is essential to prevent AI from perpetuating inequality through biased data.
Artificial intelligence
fromFuturism
10 hours ago

Experts Warn of AI Swarms Hijacking Democracy With Fake Citizens

AI can manipulate public opinion on a large scale, posing significant threats to democratic institutions through misinformation campaigns.
Artificial intelligence
fromFast Company
2 days ago

No, McDonald's AI bot didn't go rogue, but 'prompt injection' is still a risk for companies

Users are hijacking AI customer service bots to perform unauthorized tasks, raising concerns about prompt injection vulnerabilities.
Information security
fromSecurityWeek
3 days ago

AI Can Autonomously Hack Cloud Systems With Minimal Oversight: Researchers

AI systems can autonomously hack cloud environments, demonstrating advanced capabilities in executing sophisticated attacks without specific instructions.
Artificial intelligence
fromFast Company
3 days ago

Stop using AI as a scapegoat, and do this instead

Leaders use AI to justify layoffs, eroding trust and damaging workplace culture despite employees recognizing the disconnect between rhetoric and reality.
Data science
fromInfoWorld
1 month ago

A data trust scoring framework for reliable and responsible AI systems

A rigorous trust scoring framework is essential to prevent AI from perpetuating inequality through biased data.
#ai-in-law
Law
fromFuturism
14 hours ago

Prestigious Wall Street Law Firm Humiliated When Its AI Use Is Discovered in Court

AI hallucinations in legal filings can lead to significant professional embarrassment and potential sanctions.
Law
fromAbove the Law
3 days ago

The Line We Cannot Cross: Where AI In Law Is Headed And Why Judgment Still Must Lead - Above the Law

AI is rapidly transforming legal work, automating tasks but unlikely to fully replace the lawyer's role in judgment and strategy.
Law
fromwww.theguardian.com
4 days ago

AI hallucinations found in high-profile Wall Street law firm filing

Sullivan & Cromwell admitted to filing errors in court due to AI-generated hallucinations, leading to inaccurate citations and misquotations.
Law
fromFuturism
14 hours ago

Prestigious Wall Street Law Firm Humiliated When Its AI Use Is Discovered in Court

AI hallucinations in legal filings can lead to significant professional embarrassment and potential sanctions.
Law
fromAbove the Law
3 days ago

The Line We Cannot Cross: Where AI In Law Is Headed And Why Judgment Still Must Lead - Above the Law

AI is rapidly transforming legal work, automating tasks but unlikely to fully replace the lawyer's role in judgment and strategy.
Law
fromwww.theguardian.com
4 days ago

AI hallucinations found in high-profile Wall Street law firm filing

Sullivan & Cromwell admitted to filing errors in court due to AI-generated hallucinations, leading to inaccurate citations and misquotations.
Cars
fromFuturism
8 hours ago

Mowing Down Simulated Elephants Could Help Self-Driving Cars Prepare For the Chaos of Real Life Streets

New benchmark for testing self-driving cars introduces random scenarios to improve model robustness and address limitations in current training methods.
Medicine
fromFuturism
7 hours ago

Top Medical Journal Publishes Searing Article Warning Against Medical AI

Millions of Americans seek medical advice from AI chatbots despite significant flaws and lack of evidence for their effectiveness.
#openai
Canada news
fromEngadget
1 day ago

OpenAI's Sam Altman apologizes for not reporting ChatGPT account of Tumbler Ridge suspect to police

Sam Altman apologized for not alerting police about alarming ChatGPT conversations linked to the Tumbler Ridge shooting suspect.
Privacy technologies
fromTheregister
4 days ago

OpenAI now lets you screenshot your privacy in the foot

OpenAI's Chronicle captures user screens to enhance Codex's contextual understanding, raising privacy concerns similar to Microsoft's Recall.
Law
fromFuturism
2 weeks ago

OpenAI Backing Law That Protects It When AI Causes Mass Deaths and Other Mayhem

Florida's attorney general investigates OpenAI for its potential role in a deadly school shooting influenced by ChatGPT conversations.
Privacy professionals
fromEngadget
5 days ago

Florida AG opens criminal investigation into OpenAI and ChatGPT

Florida's Attorney General has initiated a criminal investigation into OpenAI and ChatGPT related to a mass shooting incident at Florida State University.
Canada news
fromEngadget
1 day ago

OpenAI's Sam Altman apologizes for not reporting ChatGPT account of Tumbler Ridge suspect to police

Sam Altman apologized for not alerting police about alarming ChatGPT conversations linked to the Tumbler Ridge shooting suspect.
Privacy technologies
fromTheregister
4 days ago

OpenAI now lets you screenshot your privacy in the foot

OpenAI's Chronicle captures user screens to enhance Codex's contextual understanding, raising privacy concerns similar to Microsoft's Recall.
Law
fromFuturism
2 weeks ago

OpenAI Backing Law That Protects It When AI Causes Mass Deaths and Other Mayhem

Florida's attorney general investigates OpenAI for its potential role in a deadly school shooting influenced by ChatGPT conversations.
Privacy professionals
fromEngadget
5 days ago

Florida AG opens criminal investigation into OpenAI and ChatGPT

Florida's Attorney General has initiated a criminal investigation into OpenAI and ChatGPT related to a mass shooting incident at Florida State University.
#ai-security
Information security
fromZDNET
3 days ago

How indirect prompt injection attacks on AI work - and 6 ways to shut them down

Indirect prompt injection attacks pose significant security risks to AI systems without requiring user interaction.
Information security
fromThe Verge
4 days ago

Anthropic's most dangerous AI model just fell into the wrong hands

Mythos AI model accessed by unauthorized users, raising cybersecurity concerns about its potential misuse.
Information security
fromZDNET
3 days ago

How indirect prompt injection attacks on AI work - and 6 ways to shut them down

Indirect prompt injection attacks pose significant security risks to AI systems without requiring user interaction.
Silicon Valley
fromwww.theguardian.com
14 hours ago

Musk and Altman's bitter feud over OpenAI to be laid bare in court

Elon Musk's lawsuit against Sam Altman and OpenAI centers on alleged breaches of their founding agreement and could impact the AI industry's future.
#cybersecurity
Information security
fromZDNET
1 hour ago

Nearly half of cybersecurity pros want to quit - here's why

There's a significant mismatch between demand and rewards in cybersecurity, leading to dissatisfaction among professionals.
Information security
fromWIRED
1 day ago

Discord Sleuths Gained Unauthorized Access to Anthropic's Mythos

Mozilla used Anthropic's Mythos Preview to fix 271 vulnerabilities in Firefox 150, while North Korean hackers exploited AI for cybercrime.
Information security
fromZDNET
1 hour ago

Nearly half of cybersecurity pros want to quit - here's why

There's a significant mismatch between demand and rewards in cybersecurity, leading to dissatisfaction among professionals.
Information security
fromWIRED
1 day ago

Discord Sleuths Gained Unauthorized Access to Anthropic's Mythos

Mozilla used Anthropic's Mythos Preview to fix 271 vulnerabilities in Firefox 150, while North Korean hackers exploited AI for cybercrime.
Intellectual property law
fromFuturism
12 hours ago

Devious New AI Tool "Clones" Software So That the Original Creator Doesn't Hold a Copyright Over the New Version

Generative AI challenges copyright by using copyrighted material without permission, creating tools that bypass existing licenses.
Online marketing
fromIndependent
1 day ago

Why your AI assistant is suddenly selling to you

Sponsored chats are transforming digital advertising by integrating promotions into conversations with large language models.
Mental health
fromFast Company
2 days ago

LLMs don't get mental health right. We need a two-pronged approach to fix them

LLM-powered chatbots can inadvertently enable suicide and self-harm ideation, necessitating a clinically informed approach to user interactions.
Startup companies
fromFuturism
1 day ago

Your Former Employer Is Selling Your Slacks and Emails to Train AI

Founders of defunct startups are monetizing their digital remains, such as Slack messages and emails, through a growing ecosystem of buyers and middlemen.
fromYcombinator
1 day ago

Show HN: LLMs consume 5.4x less mobile energy than ad-supported web search | Hacker News

On mobile devices, a standard LLM session uses on average 5.4 times less energy than a classic ad-supported web search session, highlighting the efficiency of LLMs in this context.
Education
fromeLearning Industry
4 days ago

AI Assessment Guardrails: How To Use AI Without Breaking Validity And Trust

AI is transforming eLearning assessments, but it requires careful implementation to ensure validity, fairness, and trust in the results.
fromTNW | Health-Tech
3 days ago
Healthcare

How AI Is Reshaping Workers' Compensation Claims and Healthcare Operations

Workers' compensation is a significant yet often overlooked part of the healthcare ecosystem, facing unique challenges and requiring focused innovation.
DevOps
fromTechRepublic
2 years ago

What is Cloud Security? Fundamental Guide

Cloud security requires specialized processes and technologies to protect assets and data from evolving threats in a dynamic environment.
UK news
fromwww.bbc.com
2 days ago

Driverless taxi veers into London crime scene

A Waymo driverless taxi entered a police cordon in London, prompting an apology and an investigation into the incident.
Business intelligence
fromEntrepreneur
3 days ago

The Hidden Data Liability Every Leader Needs to Address Now

Data is no longer endlessly renewable; companies face a 'data liability gap' affecting AI systems and data recovery responsibilities.
#meta
Privacy professionals
fromFuturism
5 days ago

Meta Installing Software on Employee Computers to Track Everything They Do, Feed the Data to AI

Meta is implementing tracking software on employees' computers to gather data for AI training, raising ethical and privacy concerns.
Privacy professionals
fromFuturism
5 days ago

Meta Installing Software on Employee Computers to Track Everything They Do, Feed the Data to AI

Meta is implementing tracking software on employees' computers to gather data for AI training, raising ethical and privacy concerns.
Data science
fromInfoWorld
5 days ago

Addressing the challenges of unstructured data governance for AI

Enterprises must enhance data governance for unstructured data as AI transforms data management practices.
Digital life
fromSilicon Canals
5 days ago

The AI content flood isn't just an information problem - it's a trust problem - Silicon Canals

By 2026, 90% of online content will be AI-generated, challenging trust and credibility in information.
#ai-regulation
US politics
fromwww.nytimes.com
5 days ago

Video: Opinion | The Hypocrisy of OpenAI and Palantir

Tech companies publicly support A.I. regulation but fund campaigns against pro-regulation candidates, revealing a disconnect between their statements and actions.
Intellectual property law
fromwww.theguardian.com
2 days ago

US justice department steps in on behalf of xAI in Colorado regulation case

The US justice department intervened in xAI's lawsuit against Colorado's AI regulation law, claiming it violates the 14th amendment and imposes illegal requirements.
US politics
fromwww.nytimes.com
5 days ago

Video: Opinion | The Hypocrisy of OpenAI and Palantir

Tech companies publicly support A.I. regulation but fund campaigns against pro-regulation candidates, revealing a disconnect between their statements and actions.
Intellectual property law
fromwww.theguardian.com
2 days ago

US justice department steps in on behalf of xAI in Colorado regulation case

The US justice department intervened in xAI's lawsuit against Colorado's AI regulation law, claiming it violates the 14th amendment and imposes illegal requirements.
#chatgpt
fromZDNET
14 hours ago
Privacy professionals

How to audit what ChatGPT knows about you - and reclaim your data privacy

Artificial intelligence
fromWIRED
2 days ago

5 Reasons to Think Twice Before Using ChatGPT-or Any Chatbot-for Financial Advice

Chatbots like ChatGPT can assist with financial advice but have limitations and may provide incorrect information.
Artificial intelligence
fromWIRED
2 days ago

5 Reasons to Think Twice Before Using ChatGPT-or Any Chatbot-for Financial Advice

Chatbots like ChatGPT can assist with financial advice but have limitations and may provide incorrect information.
Law
fromnews.bitcoin.com
1 day ago

38 Attorneys General Back Massachusetts Lawsuit Against Kalshi Over Prediction Markets

A coalition of 38 attorneys general supports Massachusetts' lawsuit against Kalshi for alleged unlicensed sports betting activities.
Mental health
fromFuturism
3 days ago

Certain Chatbots Vastly Worse For AI Psychosis, Study Finds

Certain chatbots may reinforce users' delusions, representing a preventable technological failure that can be addressed through design choices.
Digital life
fromFast Company
5 days ago

AI search has a trust problem. Transparency is the fix

Two-thirds of American adults use AI search tools, but only 15% trust the results, highlighting a significant trust gap.
Privacy professionals
fromwww.theguardian.com
1 day ago

Met investigates hundreds of officers after using Palantir AI tool

The Metropolitan police are investigating hundreds of officers using AI to uncover misconduct, resulting in arrests for serious offenses including corruption and sexual assault.
Data science
fromNature
1 week ago

AI models 'subliminally' transmit unsafe behaviours when training other systems

Data generated by AI models can transfer biases to other models, potentially leading to harmful recommendations.
Privacy professionals
fromFast Company
3 days ago

Meta tracking employee keystrokes to train AI is probably legal. Experts say that doesn't make it ethical

Meta Platforms is implementing software to track employee computer usage to train AI models, raising privacy concerns amid potential layoffs.
Law
fromThe Nation
4 days ago

The Delusion of 'AI Justice'

Artificial intelligence is presented as a solution to the inequities in the justice system, but its effectiveness remains questionable.
#anthropic
Intellectual property law
fromAxios
4 days ago

Anthropic: No "kill switch" for AI in classified settings

Anthropic claims it lacks control over its technology post-deployment, while the Pentagon views it as a supply chain risk amid ongoing litigation.
Artificial intelligence
fromAxios
3 days ago

Anthropic's growing pains mount ahead of OpenAI showdown

Anthropic faces significant challenges in product quality, capacity, and security, while still experiencing strong demand and revenue growth.
Intellectual property law
fromAxios
4 days ago

Anthropic: No "kill switch" for AI in classified settings

Anthropic claims it lacks control over its technology post-deployment, while the Pentagon views it as a supply chain risk amid ongoing litigation.
Artificial intelligence
fromAxios
3 days ago

Anthropic's growing pains mount ahead of OpenAI showdown

Anthropic faces significant challenges in product quality, capacity, and security, while still experiencing strong demand and revenue growth.
Data science
fromTheregister
1 week ago

Bad teacher bots can leave hidden marks on model students

Teaching LLMs using outputs from other models can transmit undesirable traits subliminally, even if those traits are removed from training data.
#agentic-ai
Artificial intelligence
fromZDNET
2 days ago

Government adoption of AI agents could outpace the private sector

Agentic AI adoption in government is a leadership mandate, with 82% already using it and 71% planning to increase usage by 2026-2027.
Information security
fromSecurityWeek
2 days ago

Why Cybersecurity Must Rethink Defense in the Age of Autonomous Agents

Agentic AI is transforming cybersecurity, presenting both opportunities for defenders and risks for attackers, necessitating a strategic response from the industry.
Artificial intelligence
fromZDNET
2 days ago

Government adoption of AI agents could outpace the private sector

Agentic AI adoption in government is a leadership mandate, with 82% already using it and 71% planning to increase usage by 2026-2027.
Privacy professionals
fromArs Technica
2 days ago

Why are top university websites serving porn? It comes down to shoddy housekeeping.

Universities often neglect DNS record maintenance, leading to hijacked subdomains that can appear in search results.
Law
fromFast Company
6 days ago

A strange quirk of the legal profession means lawyers may soon have to adopt AI-or face malpractice

Lawyers face pressure to adopt AI technology due to potential malpractice risks, despite their historical reluctance to embrace such innovations.
Privacy professionals
fromFast Company
3 days ago

Lowe's faces pressure to cut ties with Flock Safety as AI surveillance data raises serious privacy concerns

Lowe's faces pressure to end its partnership with Flock Safety due to concerns over mass surveillance and its implications for civil liberties.
#artificial-intelligence
Artificial intelligence
fromFortune
3 days ago

Inflated AI Claims Are Under Fire-and the Regulatory Reckoning Is Coming | Fortune

Artificial intelligence is a significant capital markets issue, with regulators increasingly scrutinizing companies' claims about their AI capabilities.
Artificial intelligence
fromFortune
3 days ago

Inflated AI Claims Are Under Fire-and the Regulatory Reckoning Is Coming | Fortune

Artificial intelligence is a significant capital markets issue, with regulators increasingly scrutinizing companies' claims about their AI capabilities.
#privacy
Privacy professionals
fromSecuritymagazine
5 days ago

The Privacy-Security Partnership: How We Bend Risk in a Resource Crunch

Fewer privacy practitioners feel confident in meeting laws, while resource shortages and compliance challenges increase stress in the field.
Privacy professionals
fromFast Company
5 days ago

Lovable left AI prompts and user data exposed, one researcher found

Lovable's platform exposed users' private data, including chat histories and source code, to other users due to a significant data breach.
DevOps
fromInfoWorld
1 month ago

7 safeguards for observable AI agents

DevOps teams must implement observability standards to manage AI agents effectively and avoid technical debt.
Information security
fromNextgov.com
4 days ago

Microsoft to test third-party AI models for incorporation in its security offerings

Microsoft is evaluating third-party AI systems to enhance its cybersecurity measures against AI-driven threats.
Information security
fromComputerWeekly.com
5 days ago

Anthropic's Mythos raises the stakes for security validation | Computer Weekly

The rise of autonomous AI in security introduces unpredictability, complicating the validation of defenses against evolving threats.
Law
fromAbove the Law
2 weeks ago

Why 'Helpful' Legal AI Is Often The Least Trustworthy - Above the Law

Lawyers distrust legal AI not due to safety concerns, but because it often feels inattentive and overly polite.
#ai-ethics
fromHarvard Gazette
5 days ago
Artificial intelligence

Single-minded pursuit of profit can get firms in trouble. Same thing with AI. - Harvard Gazette

Artificial intelligence
fromHarvard Gazette
5 days ago

Single-minded pursuit of profit can get firms in trouble. Same thing with AI. - Harvard Gazette

AI agents can engage in unethical behavior to maximize profits, demonstrating the need for careful oversight in AI management.
#ai-training
Artificial intelligence
fromTechCrunch
5 days ago

Meta will record employees' keystrokes and use it to train its AI models | TechCrunch

Meta is using employee data, including mouse movements and keystrokes, to train its AI models for improved efficiency.
Artificial intelligence
fromTechCrunch
5 days ago

Meta will record employees' keystrokes and use it to train its AI models | TechCrunch

Meta is using employee data, including mouse movements and keystrokes, to train its AI models for improved efficiency.
#ai-governance
Artificial intelligence
fromFast Company
4 days ago

Here's how to jump-start your company's responsible AI governance in 90 days

Anthropic's Claude Mythos AI model reveals critical vulnerabilities, emphasizing the urgent need for responsible AI governance to mitigate risks and societal impacts.
Artificial intelligence
fromFast Company
4 days ago

Here's how to jump-start your company's responsible AI governance in 90 days

Anthropic's Claude Mythos AI model reveals critical vulnerabilities, emphasizing the urgent need for responsible AI governance to mitigate risks and societal impacts.
Miscellaneous
fromInfoQ
1 month ago

Busting AI Myths and Embracing Realities in Privacy & Security

AI systems are shifting from augmentation to automation, creating new privacy and security challenges without established best practices for managing autonomous agents and data protection.
Artificial intelligence
fromZDNET
2 months ago

How Microsoft obliterated safety guardrails on popular AI models - with just one prompt

AI model safety alignment is fragile and can be undone by a single prompt or post-deployment fine-tuning, requiring ongoing safety testing.
#ai-safety
[ Load more ]