Show simple item record

dc.contributor.authorGraham, Yvetteen
dc.date.accessioned2021-03-28T15:22:22Z
dc.date.available2021-03-28T15:22:22Z
dc.date.created16/11/20en
dc.date.issued2020en
dc.date.submitted2020en
dc.identifier.citationYvette Graham; BarryHaddow; Philipp Koehn, Statistical Power and Translationese in Machine Translation Evaluation, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), Virtual, 16/11/20, Association for Computational Linguistics, 2020, 72 - 81en
dc.identifier.otherYen
dc.descriptionPUBLISHEDen
dc.descriptionVirtualen
dc.description.abstractThe term translationese has been used to describe features of translated text, and in this paper, we provide detailed analysis of potential adverse effects of translationese on machine translation evaluation. Our analysis shows differences in conclusions drawn from evaluations that include translationese in test data compared to experiments that tested only with text originally composed in that language. For this reason we recommend that reverse-created test data be omitted from future machine translation test sets. In addition, we provide a re-evaluation of a past machine translation evaluation claiming human-parity of MT. One important issue not previously considered is statistical power of significance tests applied to comparison of human and machine translation. Since the very aim of past evaluations was investigation of ties between human and MT systems, power analysis is of particular importance, to avoid, for example, claims of human parity simply corresponding to Type II error resulting from the application of a low powered test. We provide detailed analysis of tests used in such evaluations to provide an indication of a suitable minimum sample size for future studies.en
dc.format.extent72en
dc.format.extent81en
dc.language.isoenen
dc.publisherAssociation for Computational Linguisticsen
dc.rightsYen
dc.titleStatistical Power and Translationese in Machine Translation Evaluationen
dc.title.alternativeProceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)en
dc.typeConference Paperen
dc.type.supercollectionscholarly_publicationsen
dc.type.supercollectionrefereed_publicationsen
dc.identifier.peoplefinderurlhttp://people.tcd.ie/ygrahamen
dc.identifier.rssinternalid226533en
dc.identifier.doihttp://dx.doi.org/10.18653/v1/2020.emnlp-main.6en
dc.rights.ecaccessrightsopenAccess
dc.subject.TCDThemeDigital Engagementen
dc.subject.TCDThemeInclusive Societyen
dc.subject.TCDThemeInternational Integrationen
dc.subject.TCDTagArtificial Intelligenceen
dc.subject.TCDTagMachine Translationen
dc.subject.TCDTagNatural Language Processingen
dc.identifier.rssurihttps://www.aclweb.org/anthology/2020.emnlp-main.6.pdfen
dc.identifier.orcid_id0000-0001-6741-4855en
dc.subject.darat_thematicCommunicationen
dc.status.accessibleNen
dc.contributor.sponsorSFI stipenden
dc.contributor.sponsorGrantNumber13/RC/2106en
dc.identifier.urihttp://hdl.handle.net/2262/95911


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record