Researchers Learn to Measure AI's Language Skills | HackerNoonThe study standardizes NLPre evaluation using CoNLL 2018 metrics, focusing on F1 and AlignedAccuracy for consistency.