Government Test Finds That AI Wildly Underperforms Compared to Human Employees

from Futurism 7 months ago

The trial revealed generative AI's summary capabilities are significantly inferior to human capabilities, with scores of 47% for AI versus 81% for human summaries.
Futurismhttps://futurism.com/the-byte/government-ai-worse-summarizing

Using Meta's Llama2-70B model, the AI struggled to meet the expectations set for summarizing documents, casting doubt on its practical applications in business.
Futurismhttps://futurism.com/the-byte/government-ai-worse-summarizing

The findings highlight a prevalent concern about generative AI's reliability, raising questions about its utility for most organizations in workplace settings.
Futurismhttps://futurism.com/the-byte/government-ai-worse-summarizing

Evaluators in the trial reported a strong perception of the AI outputs, confirming the challenges in distinguishing between human and AI-generated summaries.
Futurismhttps://futurism.com/the-byte/government-ai-worse-summarizing

Read at Futurism

#generative-ai #summarization #human-vs-ai #australian-securities-and-investment-commission #meta-llama2

Collection

[

...

]

Government Test Finds That AI Wildly Underperforms Compared to Human EmployeesGovernment Test Finds That AI Wildly Underperforms Compared to Human Employees Briefly

Government Test Finds That AI Wildly Underperforms Compared to Human Employees
Government Test Finds That AI Wildly Underperforms Compared to Human Employees
Briefly