#kurdish-ocr
#kurdish-ocr

[ follow ]

Training Tesseract for Low-Resource Languages | HackerNoon

Trained Tesseract OCR on 1233 Kurdish text lines from pre-1950 documents to advance digitization of Kurdish historical materials.

Data science

fromHackernoon

1 year ago

Key Challenges in OCR Research and Future Directions | HackerNoon

Historical Kurdish documents are difficult to digitize due to scarce resources, unclear text, non-standard spacing, complex layouts, and limitations of current OCR methods.

[ Load more ]

#kurdish-ocr#kurdish-ocr

Training Tesseract for Low-Resource Languages | HackerNoon

Key Challenges in OCR Research and Future Directions | HackerNoon

#kurdish-ocr
#kurdish-ocr