Games provide a clear, unambiguous signal of success. Their structured nature and measurable outcomes make them the perfect testbed for evaluating models and agents. They force models to demonstrate many skills including strategic reasoning, long-term planning, and dynamic adaptation against an intelligent opponent, providing a robust signal of their general problem-solving intelligence.