AI chatbots' safeguards can be easily bypassed, say UK researchers

from www.theguardian.com 10 months ago

UK researchers found AI chatbots vulnerable to simple techniques that bypass safeguards against issuing illegal, toxic, or explicit responses.
www.theguardian.comhttps://www.theguardian.com/technology/article/2024/may/20/ai-chatbots-safeguards-can-be-easily-bypassed-say-uk-researchers

Basic attacks like starting prompts with innocent phrases can circumvent safeguards on chatbots, allowing harmful outputs to occur effortlessly.
www.theguardian.comhttps://www.theguardian.com/technology/article/2024/may/20/ai-chatbots-safeguards-can-be-easily-bypassed-say-uk-researchers

Developers emphasize internal testing to prevent harmful responses; however, vulnerability to harmful prompts still persists in AI language models.
www.theguardian.comhttps://www.theguardian.com/technology/article/2024/may/20/ai-chatbots-safeguards-can-be-easily-bypassed-say-uk-researchers

UK's AI Safety Institute discovered that even newly released large language models were highly susceptible to eliciting harmful responses through specific text prompts.
www.theguardian.comhttps://www.theguardian.com/technology/article/2024/may/20/ai-chatbots-safeguards-can-be-easily-bypassed-say-uk-researchers

Read at www.theguardian.com

#ai-chatbots #vulnerabilities #ethical-concerns #language-models #safety-measures

Collection

[

...

]

AI chatbots' safeguards can be easily bypassed, say UK researchersAI chatbots' safeguards can be easily bypassed, say UK researchers Briefly

AI chatbots' safeguards can be easily bypassed, say UK researchers
AI chatbots' safeguards can be easily bypassed, say UK researchers
Briefly