ChatGPT still reigns supreme in many AI rankings, but the competition is on

from NBC News 4 months ago

No model has yet achieved a perfect score of 100 points on any benchmark. Smaug-72B recently became the first to break past an average score of 80.
NBC Newshttps://nbcnews.to/3TgS5jT

Saturation occurs when models outgrow benchmark tests, akin to moving from middle school to high school, or due to overfitting when models memorize answers. We need new benchmarks to fairly assess model capabilities.
NBC Newshttps://nbcnews.to/3TgS5jT

Read at NBC News

#ai-models #benchmark-performance #overfitting #human-evaluation

[

]

[

...

]

ChatGPT still reigns supreme in many AI rankings, but the competition is onChatGPT still reigns supreme in many AI rankings, but the competition is on Briefly

ChatGPT still reigns supreme in many AI rankings, but the competition is on
ChatGPT still reigns supreme in many AI rankings, but the competition is on
Briefly