#reasoning-tasks
#reasoning-tasks

[ follow ]

#ai-models #machine-learning #coding-performance #zhipu-ai #ai #deepmind #llm #questbench

Artificial intelligence

GLM-4.5 Launches with Strong Reasoning, Coding, and Agentic Capabilities

Zhipu AI launched GLM-4.5 and GLM-4.5-Air, AI models for reasoning, coding, and agent tasks with a dual-mode system.

Artificial intelligence

Google DeepMind Introduces QuestBench to Evaluate LLMs in Solving Logic and Math Problems

DeepMind's QuestBench benchmark helps evaluate LLMs' capability to ask crucial clarifying questions for solving logic and math problems.

[ Load more ]