fromInfoWorld1 week agoAI benchmarking tools evaluate real world performance"xbench evaluates models not only on the ability to pass arbitrary tests but also on the ability to execute real-world tasks, which is more unusual."Artificial intelligence