dc.contributor.author | VOGEL, CARL | en |
dc.contributor.editor | Edward J. Delp III, Ping Wah Wong | en |
dc.date.accessioned | 2009-09-08T11:20:55Z | |
dc.date.available | 2009-09-08T11:20:55Z | |
dc.date.created | February 2007 | en |
dc.date.issued | 2007 | en |
dc.date.submitted | 2007 | en |
dc.identifier.citation | Brian Murphy and Carl Vogel, Statistically-constrained shallow text marking: techniques, evaluation paradigm and results, Proceedings of SPIE - The International Society for Optical Engineering, Security, Steganography, and Watermarking of Multimedia Contents IX;, San Jose, California, February 2007, Edward J. Delp III, Ping Wah Wong, 6505, International Society for Optical Engineering, 2007, 65050Z | en |
dc.identifier.other | Y | en |
dc.description | PUBLISHED | en |
dc.description | San Jose, California | en |
dc.description.abstract | We present three natural language marking strategies based on fast and reliable shallow parsing techniques, and on widely
available lexical resources: lexical substitution, adjective conjunction swaps, and relativiser switching. We test these
techniques on a random sample of the British National Corpus. Individual candidate marks are checked for goodness of
structural and semantic fit, using both lexical resources, and the web as a corpus. A representative sample of marks is given
to 25 human judges to evaluate for acceptability and preservation of meaning. This establishes a correlation between corpus
based felicity measures and perceived quality, and makes qualified predictions. Grammatical acceptability correlates with
our automatic measure strongly (Pearson?s r = 0.795, p = 0.001), allowing us to account for about two thirds of variability
in human judgements. A moderate but statistically insignificant (Pearson?s r = 0.422, p = 0.356) correlation is found with
judgements of meaning preservation, indicating that the contextual window of five content words used for our automatic
measure may need to be extended. | en |
dc.description.sponsorship | We gratefully acknowledge the support of Science Foundation Ireland funding in the Research Frontiers Programme project
05/RF/CMS002. | en |
dc.format.extent | 65050Z | en |
dc.format.mimetype | application/pdf | |
dc.language.iso | en | en |
dc.publisher | International Society for Optical Engineering | en |
dc.relation.ispartofseries | 6505 | en |
dc.rights | Y | en |
dc.subject | information hiding, shallow parsing, web corpus, human judgement, correlation | en |
dc.title | Statistically-constrained shallow text marking: techniques, evaluation paradigm and results | en |
dc.title.alternative | Proceedings of SPIE - The International Society for Optical Engineering | en |
dc.title.alternative | Security, Steganography, and Watermarking of Multimedia Contents IX; | en |
dc.type | Conference Paper | en |
dc.type.supercollection | scholarly_publications | en |
dc.type.supercollection | refereed_publications | en |
dc.identifier.peoplefinderurl | http://people.tcd.ie/vogel | en |
dc.identifier.rssinternalid | 43997 | en |
dc.identifier.doi | http://dx.doi.org/10.1117/12.713355 | en |
dc.subject.TCDTheme | Ageing | en |
dc.subject.TCDTheme | Cancer | en |
dc.subject.TCDTheme | Creative Arts Practice | en |
dc.subject.TCDTheme | Creative Technologies | en |
dc.subject.TCDTheme | Digital Humanities | en |
dc.subject.TCDTheme | Inclusive Society | en |
dc.subject.TCDTheme | Intelligent Content & Communications | en |
dc.subject.TCDTheme | International Integration | en |
dc.subject.TCDTheme | Smart & Sustainable Planet | en |
dc.subject.TCDTheme | Telecommunications | en |
dc.subject.TCDTag | Computational linguistics | en |
dc.identifier.rssuri | http://dx.doi.org/10.1117/12.713355 | |
dc.identifier.orcid_id | 0000--000-8928-8546 | en |
dc.identifier.uri | http://hdl.handle.net/2262/32209 | |