dc.contributor.author | Hill, Nathan | en |
dc.date.accessioned | 2023-10-22T17:44:20Z | |
dc.date.available | 2023-10-22T17:44:20Z | |
dc.date.issued | 2023 | en |
dc.date.submitted | 2023 | en |
dc.identifier.citation | Li, Shihua; Hill, Nathan W., Printed Text Recognition for Lexical Lists in Chinese-International Phonetic Alphabet (IPA) Glossing, Journal of Open Humanities Data, 9, 15, 2023, 1-8 | en |
dc.identifier.issn | 2059-481X | en |
dc.identifier.other | Y | en |
dc.description | PUBLISHED | en |
dc.description.abstract | This study presents a dataset serving as a benchmark for the recognition of printed
text in lexical lists using Chinese-IPA glossing. The paper provides an overview of the
baseline model, transcription model, and PyLaia engines employed in the research.
Furthermore, it elucidates the specific need for digitizing the aforementioned lexical
lists, outlines the methodology employed for training the baseline model for layout
analysis, and describes the training process of the transcription model using the ground
truth data generated on Transkribus. This comprehensive approach encompasses both
the images of the lexical list content and their corresponding transcriptions as input.
Additionally, the study highlights the limitations of the model and identifies avenues
for future development. By making this dataset openly accessible, it can be utilized by
researchers seeking to digitize lexical lists using Chinese-IPA glossing. Moreover, since
the model can recognize both Chinese characters and IPA symbols, it has the potential
to contribute to linguistic analysis of languages documented in Chinese-IPA glossing. | en |
dc.format.extent | 1-8 | en |
dc.language.iso | en | en |
dc.relation.ispartofseries | Journal of Open Humanities Data | en |
dc.relation.ispartofseries | 9 | en |
dc.relation.ispartofseries | 15 | en |
dc.rights | Y | en |
dc.title | Printed Text Recognition for Lexical Lists in Chinese-International Phonetic Alphabet (IPA) Glossing | en |
dc.type | Journal Article | en |
dc.type.supercollection | scholarly_publications | en |
dc.type.supercollection | refereed_publications | en |
dc.identifier.peoplefinderurl | http://people.tcd.ie/hillna | en |
dc.identifier.rssinternalid | 259545 | en |
dc.identifier.doi | http://dx.doi.org/10.5334/johd.119 | en |
dc.rights.ecaccessrights | openAccess | |
dc.identifier.orcid_id | 0000-0001-6423-017X | en |
dc.identifier.uri | http://hdl.handle.net/2262/104040 | |