#zero-shot

[ follow ]
fromHackernoon
2 months ago

How We Curated Seven Algorithmic Reasoning Tasks From Big-Bench Hard | HackerNoon

To evaluate LLMs' reasoning capabilities, we curated seven algorithmic reasoning tasks from Big-Bench Hard designed to measure step-by-step reasoning in zero-shot settings.
Scala
[ Load more ]