The NKJP1M subcorpus of the Polish National Corpus is crucial for training Polish POS taggers, as it ensures balanced thematic and genre diversity.
We convert NKJP1M to the CoNLL-X and CoNLL-U formats, enabling compatibility with modern NLPre tools while preserving detailed annotation from the original dataset.
Collection
[
|
...
]