#cbrn--cyber-risks

[ follow ]
Science
fromInfoWorld
1 day ago

Get poetic in prompts and AI will break its guardrails

Adversarial poetic prompts cause diverse AI models to bypass safety and reveal harmful instructions, indicating structural alignment weaknesses across model families.
[ Load more ]