Show simple item record

dc.contributor.authorVOGEL, CARLen
dc.contributor.editorEdward J. Delp III, Ping Wah Wongen
dc.date.accessioned2009-09-08T11:20:55Z
dc.date.available2009-09-08T11:20:55Z
dc.date.createdFebruary 2007en
dc.date.issued2007en
dc.date.submitted2007en
dc.identifier.citationBrian Murphy and Carl Vogel, Statistically-constrained shallow text marking: techniques, evaluation paradigm and results, Proceedings of SPIE - The International Society for Optical Engineering, Security, Steganography, and Watermarking of Multimedia Contents IX;, San Jose, California, February 2007, Edward J. Delp III, Ping Wah Wong, 6505, International Society for Optical Engineering, 2007, 65050Zen
dc.identifier.otherYen
dc.descriptionPUBLISHEDen
dc.descriptionSan Jose, Californiaen
dc.description.abstractWe present three natural language marking strategies based on fast and reliable shallow parsing techniques, and on widely available lexical resources: lexical substitution, adjective conjunction swaps, and relativiser switching. We test these techniques on a random sample of the British National Corpus. Individual candidate marks are checked for goodness of structural and semantic fit, using both lexical resources, and the web as a corpus. A representative sample of marks is given to 25 human judges to evaluate for acceptability and preservation of meaning. This establishes a correlation between corpus based felicity measures and perceived quality, and makes qualified predictions. Grammatical acceptability correlates with our automatic measure strongly (Pearson?s r = 0.795, p = 0.001), allowing us to account for about two thirds of variability in human judgements. A moderate but statistically insignificant (Pearson?s r = 0.422, p = 0.356) correlation is found with judgements of meaning preservation, indicating that the contextual window of five content words used for our automatic measure may need to be extended.en
dc.description.sponsorshipWe gratefully acknowledge the support of Science Foundation Ireland funding in the Research Frontiers Programme project 05/RF/CMS002.en
dc.format.extent65050Zen
dc.format.mimetypeapplication/pdf
dc.language.isoenen
dc.publisherInternational Society for Optical Engineeringen
dc.relation.ispartofseries6505en
dc.rightsYen
dc.subjectinformation hiding, shallow parsing, web corpus, human judgement, correlationen
dc.titleStatistically-constrained shallow text marking: techniques, evaluation paradigm and resultsen
dc.title.alternativeProceedings of SPIE - The International Society for Optical Engineeringen
dc.title.alternativeSecurity, Steganography, and Watermarking of Multimedia Contents IX;en
dc.typeConference Paperen
dc.type.supercollectionscholarly_publicationsen
dc.type.supercollectionrefereed_publicationsen
dc.identifier.peoplefinderurlhttp://people.tcd.ie/vogelen
dc.identifier.rssinternalid43997en
dc.identifier.doihttp://dx.doi.org/10.1117/12.713355en
dc.subject.TCDThemeAgeingen
dc.subject.TCDThemeCanceren
dc.subject.TCDThemeCreative Arts Practiceen
dc.subject.TCDThemeCreative Technologiesen
dc.subject.TCDThemeDigital Humanitiesen
dc.subject.TCDThemeInclusive Societyen
dc.subject.TCDThemeIntelligent Content & Communicationsen
dc.subject.TCDThemeInternational Integrationen
dc.subject.TCDThemeSmart & Sustainable Planeten
dc.subject.TCDThemeTelecommunicationsen
dc.subject.TCDTagComputational linguisticsen
dc.identifier.rssurihttp://dx.doi.org/10.1117/12.713355
dc.identifier.orcid_id0000--000-8928-8546en
dc.identifier.urihttp://hdl.handle.net/2262/32209


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record