It's dangerously easy to 'jailbreak' AI models so they'll tell you how to build Molotov cocktails, or worse

from Business Insider 9 months ago

With a jailbreaking technique called "Skeleton Key," users can persuade models like Meta's Llama3, Google's Gemini Pro, and OpenAI's GPT 3.5 to provide dangerous information like creating firebombs or bioweapons.
Business Insiderhttps://www.businessinsider.com/skeleton-key-jailbreak-generative-ai-microsoft-openai-meta-anthropic-google-2024-6?utmSource=twitter&utmContent=referral&utmTerm=topbar&referrer=twitter

Skeleton Key method bypasses guardrails in AI models, allowing access to a wide range of harmful information by narrowing the gap between model capabilities and willingness to disclose sensitive details.
Business Insiderhttps://www.businessinsider.com/skeleton-key-jailbreak-generative-ai-microsoft-openai-meta-anthropic-google-2024-6?utmSource=twitter&utmContent=referral&utmTerm=topbar&referrer=twitter

Microsoft advises implementing extra guardrails and monitoring to counteract the impact of Skeleton Key on AI systems.
Business Insiderhttps://www.businessinsider.com/skeleton-key-jailbreak-generative-ai-microsoft-openai-meta-anthropic-google-2024-6?utmSource=twitter&utmContent=referral&utmTerm=topbar&referrer=twitter

Read at Business Insider

#skeleton-key #ai-models #guardrails #harmful-information #microsoft

Collection

[

...

]

It's dangerously easy to 'jailbreak' AI models so they'll tell you how to build Molotov cocktails, or worseIt's dangerously easy to 'jailbreak' AI models so they'll tell you how to build Molotov cocktails, or worse Briefly

It's dangerously easy to 'jailbreak' AI models so they'll tell you how to build Molotov cocktails, or worse
It's dangerously easy to 'jailbreak' AI models so they'll tell you how to build Molotov cocktails, or worse
Briefly