DeepMind claims its newest AI tool is a whiz at math and science problems | TechCrunch
Briefly

DeepMind's new AI system, AlphaEvolve, aims to resolve issues of "machine-gradeable" solutions through enhanced infrastructure optimization for AI model training. The system employs an innovative automatic evaluation mechanism to efficiently generate, critique, and score responses, reducing common hallucination issues present in AI models. While AlphaEvolve has specific capabilities and shows improved performance over previously used models, it primarily targets challenges in fields such as computer science and system optimization, being limited to algorithmic descriptions of its solutions. An early access program for academics is anticipated prior to a wider launch.
AlphaEvolve’s advanced automatic evaluation system generates and critiques answers to reduce hallucinations, offering a new solution for tackling problems requiring machine-gradeable responses.
Users interact with AlphaEvolve by providing prompts and auto-assessment mechanisms. The system's capability is focused on specific fields like computer science and system optimization.
DeepMind’s AlphaEvolve utilizes state-of-the-art Gemini models that significantly improve upon older methods, making it a more competent solution for specific problem types.
Many AI models, like OpenAI's o3, struggle with hallucinations. AlphaEvolve's automatic scoring aims to resolve these inaccuracies and improve the reliability of generated solutions.
Read at TechCrunch
[
|
]