Investigating Entity Linking in Early English Legal Documents
Citation:
Gary Munnelly and Séamus Lawless, Investigating Entity Linking in Early English Legal Documents, ACM/IEEE Joint Conference on Digital Libraries, JCDL 2018, Fort Worth, Texas, USA, 3rd-6th June, 2018Download Item:
Abstract:
In this paper we investigate the accuracy and overall suitability of a variety of Entity Linking systems for the task of disambiguating entities in 17 th century depositions obtained during the 1641 Irish Rebellion. The depositions are extremely difficult for modern NLP tools to work with due to inconsistent spelling, use of language and archaic references. In order to assess the severity of difficulty faced by Entity Linking systems when working with these documents we use the depositions to create an evaluation corpus. This corpus is used as an input to the General Entity Annotator Benchmarking Framework, a standard benchmarking platform for entity annotation systems. Based on this corpus and the results obtained from General Entity Annotator Benchmarking Framework we observe that the accuracy of existing Entity Linking systems is lacking when applied to content like these depositions. This is due to a number of issues ranging from problems with existing state-of-the-art systems to poor representation of historic entities in modern knowledge bases. We discuss some interesting questions raised by this evaluation and put forward a plan for future work in order to learn more.
Sponsor
Grant Number
Science Foundation Ireland (SFI)
13/RC/2106
Author's Homepage:
http://people.tcd.ie/selawles
Author: Lawless, Seamus
Sponsor:
Science Foundation Ireland (SFI)Other Titles:
ACM/IEEE Joint Conference on Digital Libraries, JCDL 2018Type of material:
Conference PaperCollections
Availability:
Full text availableSubject (TCD):
Digital Engagement , Digital Humanities , Making Ireland , Digital Humanities , Entity Linking , Knowledge and data engineering , SEMANTIC WEBMetadata
Show full item recordLicences: