#mathematical-reasoning

[ follow ]
Artificial intelligence
fromInfoQ
4 days ago

Intel DeepMath Introduces a Smart Architecture to Make LLMs Better at Math

DeepMath uses a Qwen3-4B Thinking agent that emits small Python executors for intermediate math steps, improving accuracy and significantly reducing output length.
fromNature
1 month ago

DeepSeek's self-correcting AI model aces tough maths proofs

The model, DeepSeekMath-V2, scored 118 out of 120 points on questions from the 2024 William Lowell Putnam Mathematical Competition, beating the top human score of 90. The model also performed at the level of gold-medal winners in the International Mathematical Olympiad (IMO) 2025 and the 2024 China Mathematical Olympiad. The results are described in a preprint posted on arXiv on 27 November.
Artificial intelligence
Artificial intelligence
fromNature
1 month ago

DeepSeek's self-correcting AI model aces tough maths proofs

DeepSeekMath-V2 scored 118/120 on the 2024 Putnam, surpassing top humans and using self-verifiable reasoning to detect and correct its own errors.
Artificial intelligence
fromArs Technica
1 month ago

DeepMind's latest: An AI for handling mathematical proofs

AlphaProof achieved International Mathematical Olympiad silver-level performance and nearly gold on the Putnam, demonstrating substantial advances in automated mathematical reasoning.
fromstupidDOPE | Est. 2008
5 months ago

Google's Gemini 2.5 AI Model Launches with Major Upgrades for Ultra Users | stupidDOPE | Est. 2008

Gemini 2.5 stands out from other AI offerings thanks to its multi-agent structure. This design enables the model to simulate multiple AI agents that work together to analyze, test, and refine solutions to a task.
Artificial intelligence
[ Load more ]