Artificial intelligence
fromInfoQ
1 day agoHugging Face Releases FineTranslations, a Trillion-Token Multilingual Parallel Text Dataset
FineTranslations provides over one trillion tokens of English-parallel data across 500+ languages to improve machine translation and supplement English model pretraining.