Show simple item record

dc.contributor.authorGraham, Yvette
dc.date.accessioned2021-04-20T12:21:21Z
dc.date.available2021-04-20T12:21:21Z
dc.date.created11/12/16en
dc.date.issued2016
dc.date.submitted2016en
dc.identifier.citationGraham, Yvette, Baldwin, Timothy, Dowling, Meghan , Eskevich, Maria, Lynn, Teresa and Tounsi, Lamia (2016) Is all that glitters in MT quality estimation really gold standard? In: 26th International Conference on Computational Linguistics, 11-17 Dec 2016, Osaka, Japanen
dc.identifier.isbn978-4-87974-702-0
dc.identifier.otherY
dc.description.abstractHuman-targeted metrics provide a compromise between human evaluation of machine translation, where high inter-annotator agreement is difficult to achieve, and fully automatic metrics, such as BLEU or TER, that lack the validity of human assessment. Human-targeted translation edit rate (HTER) is by far the most widely employed human-targeted metric in machine translation, commonly employed, for example, as a gold standard in evaluation of quality estimation. Original experiments justifying the design of HTER, as opposed to other possible formulations, were limited to a small sample of translations and a single language pair, however, and this motivates our re-evaluation of a range of human-targeted metrics on a substantially larger scale. Results show significantly stronger correlation with human judgment for HBLEU over HTER for two of the nine language pairs we include and no significant difference between correlations achieved by HTER and HBLEU for the remaining language pairs. Finally, we evaluate a range of quality estimation systems employing HTER and direct assessment (DA) of translation adequacy as gold labels, resulting in a divergence in system rankings, and propose employment of DA for future quality estimation evaluations.en
dc.format.extent3124-3134en
dc.language.isoenen
dc.rightsYen
dc.subjectMachine Learningen
dc.titleIs all that glitters in MT quality estimation really gold standard?en
dc.title.alternativeProceedings of the 26th International Conference on Computational Linguistics (COLING)en
dc.title.alternative26th International Conference on Computational Linguistics (COLING)en
dc.typeConference Paperen
dc.type.supercollectionscholarly_publicationsen
dc.type.supercollectionrefereed_publicationsen
dc.identifier.peoplefinderurlhttp://people.tcd.ie/ygraham
dc.identifier.rssinternalid227712
dc.rights.ecaccessrightsopenAccess
dc.identifier.orcid_id0000-0001-6741-4855
dc.identifier.urihttps://www.aclweb.org/anthology/C16-1
dc.identifier.urihttp://hdl.handle.net/2262/96107


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record