The Impact of Training Data Bias on Automatic Generation of Video Captions

Graham, Yvette

dc.contributor.author	Graham, Yvette	en
dc.date.accessioned	2021-03-28T15:36:39Z
dc.date.available	2021-03-28T15:36:39Z
dc.date.issued	2019	en
dc.date.submitted	2019	en
dc.identifier.citation	Alan Smeaton, Yvette Graham, Kevin McGuinness, Noel O'Connor, Se?n Quinn, Eric Arazo Sanchez, The Impact of Training Data Bias on Automatic Generation of Video Captions, Proceedings of the 25th International Conference on MultiMedia Modeling, 25th International Conference on MultiMedia Modeling, Thessaloniki, Greece, 2019, 178 - 190	en
dc.identifier.other	Y	en
dc.description	PUBLISHED	en
dc.description	Thessaloniki, Greece	en
dc.description.abstract	A major issue in machine learning is availability of training data. While this historically referred to the availability of a sufficient volume of training data, recently this has shifted to the availability of sufficient unbiased training data. In this paper we focus on the effect of training data bias on an emerging multimedia application, the automatic captioning of short video clips. We use subsets of the same training data to generate different models for video captioning using the same machine learning technique and we evaluate the performances of different training data subsets using a well-known video caption benchmark, TRECVid. We train using the MSR-VTT video-caption pairs and we prune this to reduce and make the set of captions describing a video more homogeneously similar, or more diverse, or we prune randomly. We then assess the effectiveness of caption-generating trained with these variations using automatic metrics as well as direct assessment by human assessors. Our findings are preliminary and show that randomly pruning captions from the training data yields the worst performance and that pruning to make the data more homogeneous, or diverse, does improve performance slightly when compared to random. Our work points to the need for more training data, both more video clips but, more importantly, more captions for those videos.	en
dc.format.extent	178	en
dc.format.extent	190	en
dc.language.iso	en	en
dc.rights	Y	en
dc.subject	Video-to-language	en
dc.subject	Video captioning	en
dc.subject	Video understanding	en
dc.subject	Semantic similarity	en
dc.title	The Impact of Training Data Bias on Automatic Generation of Video Captions	en
dc.title.alternative	Proceedings of the 25th International Conference on MultiMedia Modeling	en
dc.title.alternative	25th International Conference on MultiMedia Modeling	en
dc.type	Conference Paper	en
dc.type.supercollection	scholarly_publications	en
dc.type.supercollection	refereed_publications	en
dc.identifier.peoplefinderurl	http://people.tcd.ie/ygraham	en
dc.identifier.rssinternalid	226540	en
dc.identifier.doi	http://dx.doi.org/10.1007%2F978-3-030-05710-7_15	en
dc.rights.ecaccessrights	openAccess
dc.subject.TCDTheme	Creative Technologies	en
dc.subject.TCDTag	Natural Language Processing	en
dc.identifier.orcid_id	0000-0001-6741-4855	en
dc.subject.darat_thematic	Communication	en
dc.subject.darat_thematic	Globalization	en
dc.status.accessible	N	en
dc.contributor.sponsor	SFI stipend	en
dc.contributor.sponsorGrantNumber	12/RC/2289	en
dc.identifier.uri	http://hdl.handle.net/2262/95914

Files in this item

Name:: Smeaton2019_Chapter_ExploringT ...
Size:: 463.8Kb
Format:: PDF

View/Open

Name:: license.txt
Size:: 3.424Kb
Format:: Text file

View/Open

This item appears in the following Collection(s)

Computer Science (Scholarly Publications)
Computer Science (Scholarly Publications)
RSS Feeds

Show simple item record

Browse

My Account

The Impact of Training Data Bias on Automatic Generation of Video Captions

Files in this item

This item appears in the following Collection(s)