Show simple item record

dc.contributor.authorDelany, Sarah Jane
dc.contributor.authorCunningham, Padraig
dc.date.accessioned2008-01-15T11:53:56Z
dc.date.available2008-01-15T11:53:56Z
dc.date.issued2004-08
dc.identifier.citationDelany, Sarah Jane; Cunningham, Padraig. 'An Analysis of Case-Base Editing in a Spam Filtering System'. - Dublin, Trinity College Dublin, Department of Computer Science, TCD-CS-2004-29, 2004, pp14en
dc.identifier.otherTCD-CS-2004-29
dc.description.abstractBecause of the volume of spam email and its evolving nature, any deployed Machine Learning-based spam filtering system will need to have procedures for case-base maintenance. Key to this will be procedures to edit the case-base to remove noise and eliminate redundancy. In this paper we present a two stage process to do this. We present a new noise reduction algorithm called Blame-Based Noise Reduction that removes cases that are observed to cause misclassification. We also present an algorithm called Conservative Redundancy Reduction that is much less aggressive than the state-of-the-art alternatives and has significantly better generalisation performance in this domain. These new techniques are evaluated against the alternatives in the literature on four datasets of 1000 emails each (50% spam and 50% non spam).en
dc.description.sponsorshipThis research was supported by funding from Enterprise Ireland under grant no. CFTD/03/219 and funding from Science Foundation Ireland under grant no. SFI-02IN.1I111.en
dc.format.extent151485 bytes
dc.format.mimetypeapplication/pdf
dc.language.isoenen
dc.publisherTrinity College Dublin, Department of Computer Scienceen
dc.relation.ispartofseriesComputer Science Technical Reporten
dc.relation.ispartofseriesTCD-CS-2004-29en
dc.relation.haspartTCD-CS-[no.]en
dc.subjectComputer Scienceen
dc.titleAn Analysis of Case-Base Editing in a Spam Filtering Systemen
dc.typeTechnical Reporten
dc.identifier.rssurihttps://www.cs.tcd.ie/publications/tech-reports/reports.04/TCD-CS-2004-29.pdf
dc.contributor.sponsorScience Foundation Ireland
dc.contributor.sponsorEnterprise Ireland
dc.identifier.urihttp://hdl.handle.net/2262/13258


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record