Text Recognition for Nepalese Manuscripts in Pracalit Script
Citation:
O'Neill, Alexander James; Hill, Nathan W., Text Recognition for Nepalese Manuscripts in Pracalit Script, Journal of Open Humanities Data, 8, 26, 2022, 1-6Download Item:
Abstract:
This dataset is a model for handwritten text recognition (HTR) of Sanskrit and Newar Nepalese manuscripts in Pracalit script. This paper introduces the state of the field in Newar literature, Newar manuscripts, and HTR engines. It explains our methodology for developing the requisite ground truth consisting of manuscript images and corresponding transcriptions, training our model with a PyLAia engine, and this model’s limitations. This dataset shared on Zenodo can be used by anyone working with manuscripts in Pracalit script, which will benefit the fields of Indology and Newar studies, as well as historical and linguistic analysis.
Author's Homepage:
http://people.tcd.ie/hillnaDescription:
PUBLISHED
Author: Hill, Nathan
Type of material:
Journal ArticleSeries/Report no:
Journal of Open Humanities Data8
26
Availability:
Full text availableSubject:
handwritten text recognition, PyLAia, Transkribus, Sanskrit, Newar, ManuscriptsDOI:
http://dx.doi.org/10.5334/johd.90ISSN:
2059-481XMetadata
Show full item recordThe following license files are associated with this item: