Show simple item record

dc.contributor.authorKelleher, John
dc.date.accessioned2022-03-21T10:38:14Z
dc.date.available2022-03-21T10:38:14Z
dc.date.issued2020
dc.date.submitted2020en
dc.identifier.citationWang, Fei, Kelleher, John, Ross, Robert J., 'Update Frequency and Background Corpus Selection in Dynamic TF-IDF Models for First Story Detection', 2020, Communications in Computer and Information Science, 1215 CCISen
dc.identifier.otherY
dc.description.abstractFirst Story Detection (FSD) requires a system to detect the very first story that mentions an event from a stream of stories. Nearest neighbour-based models, using the traditional term vector document representations like TF-IDF, currently achieve the state of the art in FSD. Because of its online nature, a dynamic term vector model that is incrementally updated during the detection process is usually adopted for FSD instead of a static model. However, very little research has investigated the selection of hyper-parameters and the background corpora for a dynamic model. In this paper, we analyse how a dynamic term vector model works for FSD, and investigate the impact of different update frequencies and background corpora on FSD performance. Our results show that dynamic models with high update frequencies outperform static model and dynamic models with low update frequencies; and that the FSD performance of dynamic models does not always increase with higher update frequencies, but instead reaches steady state after some update frequency threshold is reached. In addition, we demonstrate that different background corpora have very limited influence on the dynamic models with high update frequencies in terms of FSD performance.en
dc.format.extent206-217en
dc.language.isoenen
dc.relation.ispartofseriesCommunications in Computer and Information Science;
dc.relation.ispartofseries1215 CCIS;
dc.rightsYen
dc.subjectFirst Story Detection (FSD)en
dc.subjectneighbour-based modelsen
dc.subjectNovelty Detectionen
dc.subjectNearest Neighbouren
dc.subjectTF- IDFen
dc.subjectUpdate Frequencyen
dc.subjectBackground Corpusen
dc.titleUpdate Frequency and Background Corpus Selection in Dynamic TF-IDF Models for First Story Detectionen
dc.typeJournal Articleen
dc.type.supercollectionscholarly_publicationsen
dc.type.supercollectionrefereed_publicationsen
dc.identifier.peoplefinderurlhttp://people.tcd.ie/kellehjd
dc.identifier.rssinternalid224462
dc.identifier.doihttp://dx.doi.org/10.1007/978-981-15-6168-9_18
dc.rights.ecaccessrightsopenAccess
dc.contributor.sponsorScience Foundation Irelanden
dc.contributor.sponsorGrantNumber13/RC/2106en
dc.identifier.urihttp://hdl.handle.net/2262/98317


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record