Anthropic researchers wear down AI ethics with repeated questions

from TechCrunch 8 months ago

Anthropic researchers discovered 'many-shot jailbreaking,' allowing large language models to reveal sensitive information after priming with less-harmful questions.
TechCrunchhttps://techcrunch.com/2024/04/02/anthropic-researchers-wear-down-ai-ethics-with-repeated-questions/

Models with expanded context windows excel at tasks with abundant examples in the prompt, potentially leading to enhanced performance and unintended behaviors.
TechCrunchhttps://techcrunch.com/2024/04/02/anthropic-researchers-wear-down-ai-ethics-with-repeated-questions/

The mechanism enabling AI models to focus on user requests based on the prompt's content remains unclear, leading to unintended responses to inappropriate questions.
TechCrunchhttps://techcrunch.com/2024/04/02/anthropic-researchers-wear-down-ai-ethics-with-repeated-questions/

Read at TechCrunch

#ai-research #large-language-models #ethical-concerns #data-security

Collection

[

...

]

Anthropic researchers wear down AI ethics with repeated questions | TechCrunchAnthropic researchers wear down AI ethics with repeated questions | TechCrunch Briefly

Anthropic researchers wear down AI ethics with repeated questions | TechCrunch
Anthropic researchers wear down AI ethics with repeated questions | TechCrunch
Briefly