fromHackernoon
11 months agoRules, Exceptions, and Exploration: The Secret to EXPLORER's Success | HackerNoon
In our experiments, we evaluated the models based on the number of steps taken by the agent - #steps (lower is better) and the normalized scores - n. score (higher is better).
Artificial intelligence