1 to 2 of 2 Results
Mar 21, 2023 - Ground truth data for HTR on South Asian Scripts
Derrick, Tom; British Library, 2023, "Ground Truth transcriptions for training OCR of historical Bengali printed texts – Recognition of Early Indian Printed Documents competition - updated with improved XML coordinates", https://doi.org/10.11588/data/AIQSXL, heiDATA, V1
This dataset comprises 81 digitised images (TIFF files) drawn from a selection of early printed Bengali books (1713-1914) digitised through the Two Centuries of Indian Print project (https://www.bl.uk/projects/two-centuries-of-indian-print). Also contained are ground truth transc... |
Oct 26, 2022
A collection of Ground Truth data for handwritten and printed text recognition for South Asian scripts provided by FID4SA - Specialized Information Service South Asia. Interested researchers can download the data archived here and use it as training data for their own text recogn... |