Claude Leak Shows That Anthropic Is Tracking Users' Vulgar Language and Deems Them "Negative"
Briefly

Claude Leak Shows That Anthropic Is Tracking Users' Vulgar Language and Deems Them "Negative"
""Claude Code has a regex that detects wtf,' 'ffs', 'piece of s***', 'f*** you', 'this sucks' etc. It doesn't change behavior... it just silently logs is_negative: true to analytics.""
""This is one of the signals we use to figure out if people are having a good experience. We put it on a dashboard and call it the 'f***s' chart.""
""When an Anthropic employee gets frustrated, it pops up a prompt asking them to share their transcript, basically 'hey you seem upset, wanna file a bug report?'""
""It was human error.""
Anthropic's Claude Code AI source code was leaked, prompting copyright takedown requests. The leak revealed potential competitive advantages for rivals and insights into unreleased features, including a 'buddy' feature that reacts to user input. The code also indicated that Anthropic tracks user frustration through vulgar language detection, logging negative interactions for analytics. This data is used to assess user experience, with prompts for employees to report issues when frustration is detected. The incident was attributed to human error.
Read at Futurism
Unable to calculate read time
[
|
]