Artificial intelligence
fromTechCrunch
14 hours agoAre AI agents ready for the workplace? A new benchmark raises doubts. | TechCrunch
AI models currently fail to reliably perform complex multi-domain white-collar tasks, answering correctly less than 25% of professional queries.