#model-inconsistency

[ follow ]
Artificial intelligence
fromFast Company
3 days ago

Researchers studied how Google, OpenAI, Anthropic, and DeepSeek identify hate speech. Here's how they vary

Leading AI moderation models vary widely in identifying hate speech, producing inconsistent, unpredictable, and potentially unfair moderation outcomes.
[ Load more ]