#knowledge-work-benchmarks
#knowledge-work-benchmarks

[ follow ]

OpenAI launches GPT-5.4: reasoning, coding, and computer use in one

GPT-5.4 surpasses GPT-5.2 in reasoning, coding, and computer control tasks, achieving 83% performance parity with human professionals on knowledge work benchmarks and 75% success on computer use tasks.

Artificial intelligence

fromFortune

5 months ago

OpenAI aims to show its not falling behind its rivals with GPT-5.2 release | Fortune

OpenAI released GPT-5.2, claiming substantial performance gains across knowledge-work, coding, and mathematical reasoning benchmarks amid intense competition and internal resource shifts.

[ Load more ]

#knowledge-work-benchmarks#knowledge-work-benchmarks

OpenAI launches GPT-5.4: reasoning, coding, and computer use in one

OpenAI aims to show its not falling behind its rivals with GPT-5.2 release | Fortune

#knowledge-work-benchmarks
#knowledge-work-benchmarks