
"Alibaba Cloud's latest AI model, Qwen3-Max-Thinking, is staking a claim as one of the world's most advanced reasoning engines after posting benchmark results that delivered competitive results against leading models from Google and OpenAI. In a blog post, Alibaba said the model was trained using expanded capacity and large-scale computing resources, including reinforcement learning, which led to improvements in factual accuracy, reasoning, instruction following, alignment with human preferences, and agent-style capabilities."
""On 19 established benchmarks, it demonstrates performance comparable to leading models such as GPT-5.2-Thinking, Claude-Opus-4.5, and Gemini 3 Pro," the company said. Alibaba said it has added two key upgrades to Qwen3-Max-Thinking: adaptive tool use that lets the model retrieve information or run code as needed, and test-time scaling techniques that it says deliver stronger reasoning performance than Google's Gemini 3 Pro on selected benchmarks."
Alibaba Cloud released Qwen3-Max-Thinking, an advanced reasoning model trained with expanded capacity, large-scale computing resources, and reinforcement learning. The model shows improved factual accuracy, reasoning, instruction following, alignment with human preferences, and agent-style capabilities. On 19 established benchmarks the model demonstrated performance comparable to GPT-5.2-Thinking, Claude-Opus-4.5, and Gemini 3 Pro. Two key upgrades include adaptive tool use for information retrieval and code execution, and test-time scaling techniques that reportedly outperform Gemini 3 Pro on selected reasoning benchmarks. Analysts advise cautious evaluation across domain-specific tasks, customization, scalability, and efficiency on Alibaba Cloud versus other cloud platforms.
Read at Computerworld
Unable to calculate read time
Collection
[
|
...
]