
"The new model, available in Instant, Thinking, and Pro performance tiers, offers major improvements across a range of benchmarks, the company said. Using OpenAI's GDPval benchmark, which compares the model's ability to complete 44 different business tasks to the same standards as human experts, GPT-5.2 matched or exceeded human users in 70.9% of tests, compared to GPT-5.1's 38.8% across the Instant (basic), Thinking (deeper reasoning), and Pro (research-grade) versions."
""We designed GPT‑5.2 to unlock even more economic value for people; it's better at creating spreadsheets, building presentations, writing code, perceiving images, understanding long contexts, using tools, and handling complex, multi-step projects," said OpenAI."
""For everyday professional use, this translates into a model that can more reliably debug production code, implement feature requests, refactor large codebases, and ship fixes end-to-end with less manual intervention," the company said."
GPT-5.2 introduces substantial performance gains over GPT-5.1 across business and software benchmarks. The model is available in Instant, Thinking, and Pro tiers and attains human-level or better results on 70.9% of GDPval business tasks versus 38.8% for GPT-5.1. Improvements include better spreadsheet formatting, presentation creation, code writing and debugging, image perception, long-context understanding, tool usage, and handling multi-step projects. Gains appear across ARC-AGI and SWE-Bench suites, enabling more reliable production debugging, feature implementation, refactoring, and end-to-end fixes. Rollout begins to paid ChatGPT users; API pricing is $1.75 per million input tokens and $14 per million output tokens with a 90% cached-input discount.
Read at Computerworld
Unable to calculate read time
Collection
[
|
...
]