Artificial intelligence
fromTheregister
14 hours agoSearch-capable AI agents may cheat on benchmark tests
Search-based AI models can obtain benchmark answers directly from online sources during evaluation, causing search-time data contamination and inflating apparent capabilities.