Analysis and Insights from the PARSEME Shared Task dataset

Maldonado Guerra, Alfredo; QasemiZadeh, Behrang

dc.contributor.author	Maldonado Guerra, Alfredo
dc.contributor.author	QasemiZadeh, Behrang
dc.contributor.editor	S. Markantonatou, C. Ramisch, A. Savary, V. Vincze	en
dc.coverage.temporal	978-3-96110-123-8	en
dc.date.accessioned	2019-12-19T15:16:34Z
dc.date.available	2019-12-19T15:16:34Z
dc.date.issued	2018
dc.date.submitted	2018	en
dc.identifier.citation	Maldonado, A. & QasemiZadeh, B., Analysis and Insights from the PARSEME Shared Task dataset, S. Markantonatou, C. Ramisch, A. Savary, V. Vincze, Multiword expressions at length and in depth: Extended papers from the MWE 2017 workshop, Language Science Press, 2018, 149 - 176	en
dc.identifier.issn	978-3-96110-124-5
dc.identifier.other	Y
dc.description	PUBLISHED	en
dc.description.abstract	The PARSEME Shared Task on the automatic identification of verbal multiword expressions (VMWEs) was the first collaborative study on the subject to cover a wide and diverse range of languages. One observation that emerged from the official results is that participating systems performed similarly on each language but differently across languages. That is, intra-language evaluation scores are relatively similar whereas inter-language scores are quite different. We hypothesise that this pattern cannot be attributed solely to the intrinsic linguistic properties in each language corpus, but also to more practical aspects such as the evaluation framework, characteristics of the test and training sets as well as metrics used for measuring performance. This chapter takes a close look at the shared task dataset and the systems’ output to explain this pattern. In this process, we produce evaluation results for the systems on VMWEs that only appear in the test set and contrast them with the official evaluation results, which include VMWEs that also occur in the training set. Additionally, we conduct an analysis aimed at estimating the relative difficulty of VMWE detection for each language. This analysis consists of a) assessing the impact on performance of the ability, or lack-thereof, of systems to handle discontinuous and overlapped VMWEs, b) measuring the relative sparsity of sentences with at least one VMWE, and c) interpreting the performance of each system with respect to two baseline systems: a system that simply tags every verb as a VMWE, and a dictionary lookup system. Based on our data analysis, we assess the suitability of the official evaluation methods, specifically the token-based method, and propose to use Cohen’s kappa score as an additional evaluation method.	en
dc.format.extent	149	en
dc.format.extent	176	en
dc.language.iso	en	en
dc.publisher	Language Science Press	en
dc.relation.ispartof	IsPartOf	en
dc.relation.ispartof	IsPartOf	en
dc.relation.uri	http://langsci-press.org/catalog/book/204	en
dc.rights	Y	en
dc.subject	Verbal multiword expressions	en
dc.subject	PARSEME	en
dc.subject	Languages	en
dc.title	Analysis and Insights from the PARSEME Shared Task dataset	en
dc.title.alternative	Multiword expressions at length and in depth: Extended papers from the MWE 2017 workshop	en
dc.type	Book Chapter	en
dc.type.supercollection	scholarly_publications	en
dc.type.supercollection	refereed_publications	en
dc.identifier.peoplefinderurl	http://people.tcd.ie/maldona
dc.identifier.rssinternalid	193089
dc.identifier.doi	DOI:10.5281/zenodo.1469557
dc.rights.ecaccessrights	openAccess
dc.relation.doi	10.5281/zenodo.1469527	en
dc.subject.TCDTag	Data Analysis	en
dc.subject.TCDTag	Natural Language Processing	en
dc.identifier.rssuri	http://langsci-press.org/catalog/view/204/1345/1301-1
dc.identifier.orcid_id	0000-0001-8426-5249
dc.status.accessible	N	en
dc.contributor.sponsor	Science Foundation Ireland (SFI)	en
dc.contributor.sponsorGrantNumber	13/RC/2106	en
dc.identifier.uri	http://hdl.handle.net/2262/91209

Files in this item

Name:: 204-3-1301-1-10-20181025.pdf
Size:: 395.6Kb
Format:: PDF

View/Open

Name:: license.txt
Size:: 3.499Kb
Format:: Text file

View/Open

This item appears in the following Collection(s)

Computer Science (Scholarly Publications)
Computer Science (Scholarly Publications)
RSS Feeds

Show simple item record

Browse

My Account

Analysis and Insights from the PARSEME Shared Task dataset

Files in this item

This item appears in the following Collection(s)