
"The research evaluated chatbots on hallucination rate, customer ratings, response consistency, and downtime rate. The chatbots were then assigned a reliability risk score from 0 to 99, with higher scores indicating bigger problems. Grok achieved an 8% hallucination rate, 4.5 customer rating, 3.5 consistency, and 0.07% downtime, resulting in an overall risk score of just 6. DeepSeek followed closely with 14% hallucinations and zero downtime for a stellar risk score of 4. ChatGPT's high hallucination and downtime rates gave it the top risk score of 99, followed by Claude and Meta AI, which earned reliability risk scores of 75 and 70, respectively."
"About 65% of US companies now use AI chatbots in their daily work, and nearly 45% of employees admit they've shared sensitive company information with these tools. These numbers show well how important chatbots have become in everyday work. Dependence on AI tools will likely increase even more, so companies should choose their chatbots based on how reliable and fit they are for their specific business needs. A chatbot that everyone uses isn't necessarily the one that works best for your industry or gives accurate answer"
Grok achieved an 8% hallucination rate, a 4.5 customer rating, 3.5 response consistency, 0.07% downtime, and an overall reliability risk score of 6. ChatGPT registered a 35% hallucination rate and high downtime, earning the highest risk score of 99, while Google's Gemini recorded a 38% hallucination rate. DeepSeek posted 14% hallucinations with zero downtime and a risk score of 4. Claude and Meta AI received risk scores of 75 and 70 respectively. Approximately 65% of US companies use AI chatbots and nearly 45% of employees have shared sensitive company information with these tools.
Read at TESLARATI
Unable to calculate read time
Collection
[
|
...
]