#red-teaming

[ follow ]
Artificial intelligence
fromFuturism
2 days ago

Anthropic's Advanced New AI Tries to Run Vending Machine, Goes Bankrupt After Ordering PlayStation 5 and Live Fish

An AI agent operating on Anthropic's Claude failed to profitably run an office vending machine, incurred losses, and was shut down after three weeks.
#ai-safety
fromLogRocket Blog
5 months ago
Artificial intelligence

Stress-testing AI products: A red-teaming playbook - LogRocket Blog

AI systems reflect flaws on an industrial scale, highlighting the need for proactive red-teaming to ensure safety and compliance.
fromTechCrunch
8 months ago
Artificial intelligence

OpenAI partner says it had relatively little time to test the company's newest AI models | TechCrunch

Metr claims limited testing time for OpenAI's new models o3 and o4-mini reduces evaluation comprehensiveness.
fromTechCrunch
8 months ago
Artificial intelligence

OpenAI partner says it had relatively little time to test the company's newest AI models | TechCrunch

#ai-cybersecurity
fromIT Pro
1 week ago
Artificial intelligence

OpenAI turns to red teamers to prevent malicious ChatGPT use as company warns future models could pose 'high' security risk

fromFortune
3 months ago
Artificial intelligence

Inside Anthropic's 'Red Team'-ensuring Claude is safe, and that Anthropic is heard in the corridors of power

fromIT Pro
1 week ago
Artificial intelligence

OpenAI turns to red teamers to prevent malicious ChatGPT use as company warns future models could pose 'high' security risk

fromFortune
3 months ago
Artificial intelligence

Inside Anthropic's 'Red Team'-ensuring Claude is safe, and that Anthropic is heard in the corridors of power

Online learning
fromeLearning Industry
1 month ago

2026 Trends & Strategies Online Conference

L&D must adopt AI-driven, skills-based strategies, agile cross-functional teams, robust virtual training evaluation, and red teaming to prepare organizations for 2026.
fromThe Hacker News
1 month ago

From Tabletop to Turnkey: Building Cyber Resilience in Financial Services

Financial institutions are facing a new reality: cyber-resilience has passed from being a best practice, to an operational necessity, to a prescriptive regulatory requirement. Crisis management or Tabletop exercises, for a long time relatively rare in the context of cybersecurity, have become required as a series of regulations has introduced this requirement to FSI organizations in several regions, including DORA (Digital Operational Resilience Act) in the EU; CPS230 / CORIE (Cyber Operational Resilience Intelligence-led Exercises) in Australia;
Information security
#ai-security
Information security
fromThe Hacker News
1 month ago

Russian Ransomware Gangs Weaponize Open-Source AdaptixC2 for Advanced Attacks

AdaptixC2 is an open-source, extensible post-exploitation C2 framework with advanced features that is increasingly adopted by threat actors, including groups linked to ransomware.
Science
fromNature
2 months ago

Biothreat hunters catch dangerous DNA before it gets made

AI-enabled protein design can produce structure-preserving, sequence-diverse proteins that can bypass DNA-synthesis biosecurity screening unless screening tools are updated.
fromWIRED
4 months ago

Inside the Biden Administration's Unpublished Report on AI Safety

Researchers identified 139 novel methods to cause AI systems to misbehave, including generating misinformation and leaking personal data, during a red teaming exercise.
US politics
[ Load more ]