Show simple item record

dc.contributor.authorHill, Nathanen
dc.date.accessioned2023-10-22T17:44:20Z
dc.date.available2023-10-22T17:44:20Z
dc.date.issued2023en
dc.date.submitted2023en
dc.identifier.citationLi, Shihua; Hill, Nathan W., Printed Text Recognition for Lexical Lists in Chinese-International Phonetic Alphabet (IPA) Glossing, Journal of Open Humanities Data, 9, 15, 2023, 1-8en
dc.identifier.issn2059-481Xen
dc.identifier.otherYen
dc.descriptionPUBLISHEDen
dc.description.abstractThis study presents a dataset serving as a benchmark for the recognition of printed text in lexical lists using Chinese-IPA glossing. The paper provides an overview of the baseline model, transcription model, and PyLaia engines employed in the research. Furthermore, it elucidates the specific need for digitizing the aforementioned lexical lists, outlines the methodology employed for training the baseline model for layout analysis, and describes the training process of the transcription model using the ground truth data generated on Transkribus. This comprehensive approach encompasses both the images of the lexical list content and their corresponding transcriptions as input. Additionally, the study highlights the limitations of the model and identifies avenues for future development. By making this dataset openly accessible, it can be utilized by researchers seeking to digitize lexical lists using Chinese-IPA glossing. Moreover, since the model can recognize both Chinese characters and IPA symbols, it has the potential to contribute to linguistic analysis of languages documented in Chinese-IPA glossing.en
dc.format.extent1-8en
dc.language.isoenen
dc.relation.ispartofseriesJournal of Open Humanities Dataen
dc.relation.ispartofseries9en
dc.relation.ispartofseries15en
dc.rightsYen
dc.titlePrinted Text Recognition for Lexical Lists in Chinese-International Phonetic Alphabet (IPA) Glossingen
dc.typeJournal Articleen
dc.type.supercollectionscholarly_publicationsen
dc.type.supercollectionrefereed_publicationsen
dc.identifier.peoplefinderurlhttp://people.tcd.ie/hillnaen
dc.identifier.rssinternalid259545en
dc.identifier.doihttp://dx.doi.org/10.5334/johd.119en
dc.rights.ecaccessrightsopenAccess
dc.identifier.orcid_id0000-0001-6423-017Xen
dc.identifier.urihttp://hdl.handle.net/2262/104040


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record