fromLawSites
1 week agoAI Legal Research Startup Descrybe Launches 'Legal Reasoning' Tool; Says It Outperforms ChatGPT, Claude, and Gemini on Bar Exam Benchmark
DescrybeLM answered all 200 correctly. The general-purpose models each missed between 13 and 23 questions, achieving accuracy rates ranging from 88.5% to 93.5%. Rubric-scored reasoning quality - a separate measure evaluating whether systems correctly identified governing legal rules and applied them to the facts - followed a similar pattern. DescrybeLM scored 99.70% on that dimension.
Artificial intelligence
