These researchers used NPR Sunday Puzzle questions to benchmark AI 'reasoning' models | TechCrunchThe Sunday Puzzle serves as an effective AI benchmarking tool, revealing limitations of reasoning models in solving human-like riddles.