Show simple item record

dc.contributor.authorGreene, Derek
dc.contributor.authorCunningham, Padraig
dc.date.accessioned2008-01-29T11:56:03Z
dc.date.available2008-01-29T11:56:03Z
dc.date.issued2006-02-07
dc.identifier.citationGreene, Derek; Cunningham, Padraig. 'Practical Solutions to the Problem of Diagonal Dominance in Kernel Document Clustering'. - Dublin, Trinity College Dublin, Department of Computer Science, TCD-CS-2006-04, 2006, pp14en
dc.identifier.otherTCD-CS-2006-04
dc.description.abstractIn supervised kernel methods, it has been observed that the performance of the SVM classifier is poor in cases where the diagonal entries of the Gram matrix are large relative to the off-diagonal entries. This problem, referred to as diagonal dominance, often occurs when certain kernel functions are applied to sparse high-dimensional data, such as text corpora. In this paper we investigate the implications of diagonal dominance for unsupervised kernel methods, specifically in the task of document clustering. We discuss a selection of strategies for addressing this issue, and evaluate their effectiveness in producing more accurate and stable clusterings.en
dc.format.extent680524 bytes
dc.format.mimetypeapplication/pdf
dc.language.isoenen
dc.publisherTrinity College Dublin, Department of Computer Scienceen
dc.relation.ispartofseriesComputer Science Technical Reporten
dc.relation.ispartofseriesTCD-CS-2006-04en
dc.relation.haspartTCD-CS-[no.]en
dc.subjectComputer Scienceen
dc.titlePractical Solutions to the Problem of Diagonal Dominance in Kernel Document Clusteringen
dc.typeTechnical Reporten
dc.identifier.rssurihttps://www.cs.tcd.ie/publications/tech-reports/reports.06/TCD-CS-2006-04.pdf
dc.identifier.urihttp://hdl.handle.net/2262/13518


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record