#knowledge-work-benchmarks

[ follow ]
Artificial intelligence
fromTechzine Global
1 week ago

OpenAI launches GPT-5.4: reasoning, coding, and computer use in one

GPT-5.4 surpasses GPT-5.2 in reasoning, coding, and computer control tasks, achieving 83% performance parity with human professionals on knowledge work benchmarks and 75% success on computer use tasks.
Artificial intelligence
fromFortune
3 months ago

OpenAI aims to show its not falling behind its rivals with GPT-5.2 release | Fortune

OpenAI released GPT-5.2, claiming substantial performance gains across knowledge-work, coding, and mathematical reasoning benchmarks amid intense competition and internal resource shifts.
[ Load more ]