dc.contributor.author | Vogel, Carl | |
dc.contributor.author | Moreau, Erwan | |
dc.date.accessioned | 2022-10-11T10:56:43Z | |
dc.date.available | 2022-10-11T10:56:43Z | |
dc.date.issued | 2022 | |
dc.date.submitted | 2022 | en |
dc.identifier.citation | Moreau, Erwan and Vogel, Carl, CLG Authorship Analytics: a library for authorship verification, International Journal of Digital Humanities, 2022 | en |
dc.identifier.other | Y | |
dc.description.abstract | The task of authorship verification consists in detecting whether two texts have been written by the same person. This paper describes the CLG Authorship Analytics software, which implements several individual methods as well as a stacked generalization system for authorship verification. The approach relies primarily on ensemble learning methods, i.e. repeatedly sampling the data in order to capture the invariant stylistic patterns. The approach is tested through a series of experiments designed to test the ability of the system to generalize, depending on various parameters. The code and results of the experiments are publicly available https://github.com/erwanm/clg-authorship-experiments. | en |
dc.language.iso | en | en |
dc.relation.ispartofseries | International Journal of Digital Humanities; | |
dc.rights | Y | en |
dc.subject | Stylometry | en |
dc.subject | Authorship verification | en |
dc.subject | Genetic learning | en |
dc.subject | Stacked generalization | en |
dc.title | CLG Authorship Analytics: a library for authorship verification | en |
dc.type | Journal Article | en |
dc.type.supercollection | scholarly_publications | en |
dc.type.supercollection | refereed_publications | en |
dc.identifier.peoplefinderurl | http://people.tcd.ie/vogel | |
dc.identifier.peoplefinderurl | http://people.tcd.ie/moreaue | |
dc.identifier.rssinternalid | 246642 | |
dc.identifier.doi | https://doi.org/10.1007/s42803-022-00051-w | |
dc.rights.ecaccessrights | openAccess | |
dc.subject.TCDTheme | Digital Humanities | en |
dc.subject.TCDTag | Computational Linguistics | en |
dc.subject.TCDTag | Computational linguistics | en |
dc.subject.TCDTag | authorship attribution | en |
dc.subject.TCDTag | authorship profiling | en |
dc.subject.TCDTag | computational linguistics | en |
dc.subject.TCDTag | computational stylistics | en |
dc.subject.TCDTag | forensic linguistics | en |
dc.subject.TCDTag | stylistics | en |
dc.identifier.orcid_id | 0000-0001-8928-8546 | |
dc.status.accessible | N | en |
dc.contributor.sponsor | Science Foundation Ireland (SFI) | en |
dc.contributor.sponsorGrantNumber | 13/RC/2106 | en |
dc.identifier.uri | http://hdl.handle.net/2262/101337 | |