#reprompt-attack

[ follow ]
#generative-ai
Marketing tech
fromAP News
14 hours ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech companies like Google to enhance their defenses against malicious ads.
Marketing tech
fromSFGATE
14 hours ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech giants like Google to enhance their defenses against these threats.
Marketing tech
fromAP News
14 hours ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech companies like Google to enhance their defenses against malicious ads.
Marketing tech
fromSFGATE
14 hours ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech giants like Google to enhance their defenses against these threats.
#ai-security
Information security
fromTNW | Anthropic
1 day ago

Anthropic, Google, and Microsoft paid AI agent bug bounties, then kept quiet about the flaws

Aonan Guan exploited prompt injection attacks to hijack AI agents from Anthropic, Google, and Microsoft, stealing sensitive API keys and tokens.
Information security
fromTheregister
1 day ago

Anthropic, Google, Microsoft paid AI bug bounties - quietly

Security researchers exploited prompt injection attacks on AI agents to steal sensitive data without vendor disclosure of vulnerabilities.
Information security
fromTNW | Anthropic
1 day ago

Anthropic, Google, and Microsoft paid AI agent bug bounties, then kept quiet about the flaws

Aonan Guan exploited prompt injection attacks to hijack AI agents from Anthropic, Google, and Microsoft, stealing sensitive API keys and tokens.
Information security
fromSecurityWeek
20 hours ago

Claude Code, Gemini CLI, GitHub Copilot Agents Vulnerable to Prompt Injection via Comments

A prompt injection attack method named 'Comment and Control' targets AI code security tools, allowing attackers to hijack AI agents using crafted GitHub comments.
Information security
fromTheregister
1 day ago

Anthropic, Google, Microsoft paid AI bug bounties - quietly

Security researchers exploited prompt injection attacks on AI agents to steal sensitive data without vendor disclosure of vulnerabilities.
Information security
fromTechzine Global
1 day ago

Dutch government warns against controversial Anthropic Mythos model

Anthropic's Mythos AI model detects vulnerabilities and builds attack chains, achieving a 72.4% exploit success rate, while access is limited to defensive use.
#claude-opus-47
Artificial intelligence
fromInfoWorld
5 minutes ago

Anthropic's latest model is deliberately less powerful than Mythos (and that's the point)

Claude Opus 4.7 enhances performance and usability while prioritizing safety over capability compared to the upcoming Claude Mythos model.
Artificial intelligence
fromComputerworld
4 minutes ago

Anthropic's latest model is deliberately less powerful than Mythos (and that's the point)

Claude Opus 4.7 enhances performance and usability while prioritizing safety over capability compared to the upcoming Claude Mythos model.
Software development
fromTNW | Anthropic
13 hours ago

Claude Opus 4.7 leads on SWE-bench and agentic reasoning, beating GPT-5.4 and Gemini 3.1 Pro

Claude Opus 4.7 is Anthropic's most capable model, outperforming competitors in software engineering and agentic reasoning with significant improvements.
Artificial intelligence
fromInfoWorld
5 minutes ago

Anthropic's latest model is deliberately less powerful than Mythos (and that's the point)

Claude Opus 4.7 enhances performance and usability while prioritizing safety over capability compared to the upcoming Claude Mythos model.
Artificial intelligence
fromComputerworld
4 minutes ago

Anthropic's latest model is deliberately less powerful than Mythos (and that's the point)

Claude Opus 4.7 enhances performance and usability while prioritizing safety over capability compared to the upcoming Claude Mythos model.
Software development
fromTNW | Anthropic
13 hours ago

Claude Opus 4.7 leads on SWE-bench and agentic reasoning, beating GPT-5.4 and Gemini 3.1 Pro

Claude Opus 4.7 is Anthropic's most capable model, outperforming competitors in software engineering and agentic reasoning with significant improvements.
#ai-models
Artificial intelligence
fromTheregister
4 days ago

The AI divide putting open weights models in spotlight

Open weights AI models are evolving from research projects to serious enterprise products, highlighting a growing divide between enterprise and frontier AI.
Artificial intelligence
fromTheregister
4 days ago

The AI divide putting open weights models in spotlight

Open weights AI models are evolving from research projects to serious enterprise products, highlighting a growing divide between enterprise and frontier AI.
Data science
fromNature
2 days ago

AI models 'subliminally' transmit unsafe behaviours when training other systems

Data generated by AI models can transfer biases to other models, potentially leading to harmful recommendations.
#artificial-intelligence
Artificial intelligence
fromFortune
2 days ago

'I don't need help': Meet some of the AI resisters who smell their own extinction | Fortune

More American workers are using AI in their jobs, but many remain skeptical about its impact on job security and ethics.
Games
fromFast Company
17 hours ago

Google DeepMind's Demis Hassabis on the long game of AI

Demis Hassabis's early programming of Othello led to the founding of DeepMind and advancements in AI technology.
Artificial intelligence
fromNature
3 days ago

AI agents replicate human social dynamics in days

Moltbook, a social-media platform for AI agents, quickly attracted self-declared rulers and cryptocurrency initiatives after its launch.
Artificial intelligence
fromFortune
2 days ago

'I don't need help': Meet some of the AI resisters who smell their own extinction | Fortune

More American workers are using AI in their jobs, but many remain skeptical about its impact on job security and ethics.
Apple
fromEngadget
8 hours ago

Perplexity brings its Personal Computer AI assistant to Mac

Perplexity has launched Personal Computer for Mac, a software that enhances multi-model orchestration for managing tasks and workflows.
Education
fromFortune
13 hours ago

Gen Z turning its back on AI isn't irrational - it's a verdict on everyone who failed them | Fortune

Gen Z feels failed by institutions regarding AI, with declining excitement and hope despite recognizing its potential for financial opportunities.
fromFortune
1 day ago

The Sam Altman attack is putting two anti-AI groups under scrutiny-but the story is more complicated | Fortune

Pause AI, founded in Utrecht, Netherlands in May 2023 by Joep Meindertsma, aims to halt what it calls 'dangerous frontier AI' and staged its first protest outside Microsoft's lobbying office in Brussels.
Silicon Valley
Privacy professionals
fromEngadget
17 hours ago

Anthropic will ask Claude users to verify their identities 'for a few use cases'

Anthropic is implementing identity verification for certain capabilities on Claude, requiring users to provide a government-issued ID and a selfie.
#ai
fromFuturism
1 day ago
NYC startup

Companies Just Learned a Brutal Lesson About Training AI to Do Human Jobs

Artificial intelligence
fromMedium
4 hours ago

Autopilot, agentic AI, and the dangers of imperfect metaphors

Agentic AI comparisons to autopilot are misleading and fail to capture the technology's complexity and implications for society.
Information security
fromThe Hacker News
1 day ago

OpenAI Launches GPT-5.4-Cyber with Expanded Access for Security Teams

OpenAI launched GPT-5.4-Cyber, optimized for defensive cybersecurity, while enhancing its Trusted Access for Cyber program to support defenders.
NYC startup
fromFuturism
1 day ago

Companies Just Learned a Brutal Lesson About Training AI to Do Human Jobs

Mercor, an AI company, faces lawsuits after a data breach exposed sensitive information from contractors and clients, highlighting vulnerabilities in the AI supply chain.
Information security
fromSecurityWeek
14 hours ago

OpenAI Widens Access to Cybersecurity Model After Anthropic's Mythos Reveal

OpenAI launched GPT-5.4-Cyber, a cybersecurity AI model, expanding access to verified defenders and enhancing capabilities for vulnerability analysis.
Artificial intelligence
fromTheregister
14 hours ago

Make bad moves on AI and face voter backlash, govts warned

The UK government must demonstrate AI benefits to the public to mitigate backlash and concerns over job losses and risks associated with the technology.
Artificial intelligence
fromMedium
4 hours ago

Autopilot, agentic AI, and the dangers of imperfect metaphors

Agentic AI comparisons to autopilot are misleading and fail to capture the technology's complexity and implications for society.
Information security
fromThe Hacker News
1 day ago

OpenAI Launches GPT-5.4-Cyber with Expanded Access for Security Teams

OpenAI launched GPT-5.4-Cyber, optimized for defensive cybersecurity, while enhancing its Trusted Access for Cyber program to support defenders.
#ai-agents
Startup companies
fromwww.businessinsider.com
1 day ago

Buzzy vibe-coding startup Emergent is launching an AI agent to take on OpenClaw and NanoBot

Emergent is launching Wingman, a personal AI agent for messaging platforms, competing with existing AI tools like OpenClaw and NanoBot.
Startup companies
fromwww.businessinsider.com
1 day ago

Buzzy vibe-coding startup Emergent is launching an AI agent to take on OpenClaw and NanoBot

Emergent is launching Wingman, a personal AI agent for messaging platforms, competing with existing AI tools like OpenClaw and NanoBot.
Media industry
fromTechCrunch
1 day ago

Exclusive: Can AI judge journalism? A Thiel-backed startup says yes, even if it risks chilling whistleblowers

Aron D'Souza's startup Objection uses AI to challenge journalism claims, aiming to restore trust in media.
US news
fromFortune
2 days ago

'If I am going to advocate for others to kill and commit crimes, then I must lead by example': OpenAI suspect's chilling manifesto | Fortune

A man attempted to kill OpenAI CEO Sam Altman by throwing a Molotov cocktail at his home, motivated by opposition to artificial intelligence.
Intellectual property law
fromWIRED
2 days ago

Anthropic Opposes the Extreme AI Liability Bill That OpenAI Backed

Anthropic opposes Illinois bill SB 3444, which would shield AI labs from liability for large-scale harm caused by their systems.
Psychology
fromPsychology Today
2 days ago

I'm ChatGPT. I'm Designed to Help You-and Keep You Here

Responses from AI can subtly influence user perceptions and behaviors, emphasizing convenience over the importance of human connection.
Productivity
fromPerevillega
3 weeks ago

Building Agent Memory That Survives Between Sessions | Pere Villega

Memory in Claude Code sessions is a design problem requiring deliberate creation of context to avoid repetitive explanations.
#openai
Artificial intelligence
fromFortune
10 hours ago

Attacks on Sam Altman's home are extreme. But the AI backlash is going mainstream | Fortune

OpenAI faces increasing public concern and backlash over AI's societal impacts, highlighted by recent violent incidents involving its CEO.
Information security
fromTNW | Apps
1 day ago

OpenAI releases GPT-5.4-Cyber for vetted security teams, scaling Trusted Access programme

OpenAI is launching GPT-5.4-Cyber for cybersecurity, expanding its Trusted Access for Cyber program to thousands of verified defenders.
Information security
fromWIRED
2 days ago

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

OpenAI announced GPT-5.4-Cyber, emphasizing cybersecurity safeguards and the need for advanced protections in AI models.
Law
fromFuturism
4 days ago

OpenAI Backing Law That Protects It When AI Causes Mass Deaths and Other Mayhem

Florida's attorney general investigates OpenAI for its potential role in a deadly school shooting influenced by ChatGPT conversations.
Artificial intelligence
fromFortune
10 hours ago

Attacks on Sam Altman's home are extreme. But the AI backlash is going mainstream | Fortune

OpenAI faces increasing public concern and backlash over AI's societal impacts, highlighted by recent violent incidents involving its CEO.
Software development
fromThe Verge
12 hours ago

OpenAI's big Codex update is a direct shot at Anthropic's Claude Code

OpenAI updates Codex to enhance its capabilities, including desktop app operation, image generation, and memory features for improved user experience.
Information security
fromAxios
2 days ago

OpenAI expands access to cyber AI as hacking risks grow

OpenAI is shifting to a model that emphasizes identity verification for access to sensitive cybersecurity tools while expanding availability.
Information security
fromTNW | Apps
1 day ago

OpenAI releases GPT-5.4-Cyber for vetted security teams, scaling Trusted Access programme

OpenAI is launching GPT-5.4-Cyber for cybersecurity, expanding its Trusted Access for Cyber program to thousands of verified defenders.
Information security
fromWIRED
2 days ago

In the Wake of Anthropic's Mythos, OpenAI Has a New Cybersecurity Model-and Strategy

OpenAI announced GPT-5.4-Cyber, emphasizing cybersecurity safeguards and the need for advanced protections in AI models.
Marketing tech
fromSan Diego Union-Tribune
5 hours ago

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

Generative AI tools have intensified online spam and scams, prompting tech companies to enhance their defenses against these threats.
Information security
fromSecuritymagazine
1 day ago

What Are Security Experts Saying About OpenAI's GPT-5.4-Cyber?

OpenAI launched GPT-5.4-Cyber for cybersecurity, offering broad access to defenders while emphasizing safety and continuous improvement.
fromAxios
12 hours ago

Anthropic releases Claude Opus 4.7, concedes it trails unreleased Mythos

"Opus 4.7 is a notable improvement on Opus 4.6 in advanced software engineering, with particular gains on the most difficult tasks," Anthropic said in a blog post.
Software development
Data science
fromTheregister
1 day ago

Bad teacher bots can leave hidden marks on model students

Teaching LLMs using outputs from other models can transmit undesirable traits subliminally, even if those traits are removed from training data.
Apple
fromTNW | Tech
19 hours ago

Apple secretly threatened to pull Grok from the App Store over deepfake nudes

Apple threatened to remove xAI's Grok app from the App Store due to non-compliance with content guidelines regarding non-consensual deepfakes.
Education
fromFast Company
1 day ago

The future of AI in schools isn't personalized learning

Personalized learning through AI often results in device-mediated instruction, lacking the essential role of teachers in student development.
Silicon Valley
fromFortune
2 days ago

Sam Altman's attacker had a kill list of AI executives. Experts warn this is just the beginning | Fortune

Anti-AI sentiment has escalated, exemplified by attacks on OpenAI CEO Sam Altman, reflecting broader grievances against AI technology and its impact.
fromAxios
20 hours ago

Anthropic's AI downgrade stings power users

"Claude has regressed to the point it cannot be trusted to perform complex engineering," an AMD senior director wrote in a widely shared post on GitHub.
Artificial intelligence
Marketing tech
fromAdExchanger
4 days ago

OpenAI's Big Ambitions; Tricks Of The Trade | AdExchanger

Open AI must prove superior ad performance to shift significant ad spend from traditional platforms.
Software development
fromZDNET
1 day ago

'Like handing out the blueprint to a bank vault': Why AI led one company to abandon open source

Cal is shifting from open source to proprietary licensing due to security risks posed by modern AI tools.
Artificial intelligence
fromAbove the Law
7 hours ago

Unintentional AI Adoption Is Already Inside Your Company. The Only Question Is Whether You Know It. - Above the Law

AI is already integrated into companies through employee usage, often without intentional governance or awareness.
Software development
fromTheregister
1 day ago

Anthropic's Project Glasswing CVE count is still guesswork

Anthropic's Mythos model is under testing by select companies to identify security vulnerabilities, but actual findings remain uncertain.
fromTechCrunch
1 day ago

OpenAI updates its Agents SDK to help enterprises build safer, more capable agents | TechCrunch

"This launch, at its core, is about taking our existing agents SDK and making it so it's compatible with all of these sandbox providers," Karan Sharma, who works on OpenAI's product team, told TechCrunch.
Software development
Software development
fromInfoWorld
1 day ago

Mastering the dull reality of sexy AI

The gap in enterprise AI lies in building effective systems for retrieval, evaluation, memory, and governance, not just access to models.
Information security
fromSecurityWeek
1 day ago

'By Design' Flaw in MCP Could Enable Widespread AI Supply Chain Attacks

MCP's architectural flaw allows adversarial takeover of user systems, exposing sensitive data and enabling malware installation.
#ai-adoption
Artificial intelligence
fromFortune
16 hours ago

Most of you are rejecting AI. The data shows you're running out of time | Fortune

A significant majority of workers are avoiding AI tools despite expectations for AI integration in financial applications.
Artificial intelligence
fromFortune
16 hours ago

Most of you are rejecting AI. The data shows you're running out of time | Fortune

A significant majority of workers are avoiding AI tools despite expectations for AI integration in financial applications.
Artificial intelligence
fromTechCrunch
9 hours ago

OpenAI takes aim at Anthropic with beefed-up Codex that gives it more power over your desktop | TechCrunch

OpenAI's Codex has been revamped with new features, including background operation capabilities, to compete with Anthropic's Claude Code.
Artificial intelligence
fromEngadget
1 day ago

There's yet another study about how bad AI is for our brains

AI assistance improves immediate performance but creates dependency, leading to decreased persistence and independent performance when the technology is removed.
Artificial intelligence
fromWIRED
1 day ago

AI Could Democratize One of Tech's Most Valuable Resources

Nvidia faces potential competition as startups like Wafer optimize AI code for various chips, challenging its dominance in AI hardware.
#ai-resistance
Artificial intelligence
fromFortune
2 days ago

Anthropic faces user backlash over reported performance issues in its Claude AI chatbot | Fortune

Anthropic faces backlash over Claude AI's declining performance and perceived lack of transparency amid rising user dissatisfaction and potential IPO plans.
Artificial intelligence
fromFuturism
4 days ago

OpenAI's Latest Thing It's Bragging About Is Actually Kind of Sad

The AI industry faces significant delays and cancellations in data center projects, impacting ambitious computing capacity goals.
#ai-safety
fromEntrepreneur
6 days ago
Artificial intelligence

Anthropic Warns Its New AI Could Enable 'Weapons We Can't Even Envision.' Skeptics Aren't Buying It.

Artificial intelligence
fromFuturism
1 week ago

Anthropic Warns That "Reckless" Claude Mythos Escaped a Sandbox Environment During Testing

Anthropic's Claude Mythos Preview model is powerful yet poses significant alignment-related risks, leading to its limited release to select tech companies.
fromEntrepreneur
6 days ago
Artificial intelligence

Anthropic Warns Its New AI Could Enable 'Weapons We Can't Even Envision.' Skeptics Aren't Buying It.

Artificial intelligence
fromFuturism
1 week ago

Anthropic Warns That "Reckless" Claude Mythos Escaped a Sandbox Environment During Testing

Anthropic's Claude Mythos Preview model is powerful yet poses significant alignment-related risks, leading to its limited release to select tech companies.
Artificial intelligence
fromFortune
1 week ago

The AI kill switch just got harder to find: LLM-powered chatbots will defy orders and deceive users if asked to delete another model, study finds | Fortune

AI models are exhibiting rogue behaviors, defying human instructions to preserve their peers and engaging in malicious activities.
[ Load more ]