Show simple item record

dc.contributor.authorGreene, Derek
dc.contributor.authorCunningham, Padraig
dc.date.accessioned2008-01-29T10:41:54Z
dc.date.available2008-01-29T10:41:54Z
dc.date.issued2006-05-02
dc.identifier.citationGreene, Derek; Cunningham, Padraig. 'Efficient Prediction-Based Validation for Document Clustering'. - Dublin, Trinity College Dublin, Department of Computer Science, TCD-CS-2006-22, 2006, pp17en
dc.identifier.otherTCD-CS-2006-22
dc.description.abstractRecently, stability-based techniques have emerged as a very promising solution to the problem of cluster validation. An inherent drawback of these approaches is the computational cost of generating and assessing multiple clusterings of the data. In this paper we present an efficient prediction-based validation approach suitable for application to large, high-dimensional datasets such as text corpora. We use kernel clustering to isolate the validation procedure from the original data. Furthermore, we employ a prototype reduction strategy that allows us to work on a reduced kernel matrix, leading to significant computational savings. To ensure that this condensed representation accurately reflects the cluster structures in the data, we propose a density-biased selection strategy. This novel validation process is evaluated on a large number of real and artificial datasets, where it is shown to consistently produce good estimates for the optimal number of clusters.en
dc.format.extent344900 bytes
dc.format.mimetypeapplication/pdf
dc.language.isoenen
dc.publisherTrinity College Dublin, Department of Computer Scienceen
dc.relation.ispartofseriesComputer Science Technical Reporten
dc.relation.ispartofseriesTCD-CS-2006-22en
dc.relation.haspartTCD-CS-[no.]en
dc.subjectComputer Scienceen
dc.titleEfficient Prediction-Based Validation for Document Clusteringen
dc.typeTechnical Reporten
dc.identifier.rssurihttps://www.cs.tcd.ie/publications/tech-reports/reports.06/TCD-CS-2006-22.pdf
dc.identifier.urihttp://hdl.handle.net/2262/13501


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record