Multimodal continuous turn-taking prediction using multiscale RNNs

Harte, Naomi

dc.contributor.author	Harte, Naomi
dc.date.accessioned	2019-10-08T08:53:39Z
dc.date.available	2019-10-08T08:53:39Z
dc.date.issued	2018
dc.date.submitted	2018	en
dc.identifier.citation	Roddy, M., Skantze, G., Harte, N. Multimodal continuous turn-taking prediction using multiscale RNNs, ICMI’18, October 16-20, 2018, Boulder, CO, USA	en
dc.identifier.other	Y
dc.description.abstract	In human conversational interactions, turn-taking exchanges can be coordinated using cues from multiple modalities. To design spoken dialog systems that can conduct fluid interactions it is desirable to incorporate cues from separate modalities into turn-taking models. We propose that there is an appropriate temporal granularity at which modalities should be modeled. We design a multiscale RNN architecture to model modalities at separate timescales in a continuous manner. Our results show that modeling linguistic and acoustic features at separate temporal rates can be beneficial for turn-taking modeling. We also show that our approach can be used to incorporate gaze features into turn-taking models.	en
dc.format.extent	186-190	en
dc.language.iso	en	en
dc.rights	Y	en
dc.subject	Spoken dialog systems	en
dc.subject	Turn-taking modeling	en
dc.title	Multimodal continuous turn-taking prediction using multiscale RNNs	en
dc.title.alternative	ICMI 2018 - 20th ACM International Conference on Multimodal Interaction	en
dc.type	Conference Paper	en
dc.type.supercollection	scholarly_publications	en
dc.type.supercollection	refereed_publications	en
dc.identifier.peoplefinderurl	http://people.tcd.ie/nharte
dc.identifier.rssinternalid	205162
dc.identifier.doi	http://dx.doi.org/10.1145/3242969.3242997
dc.rights.ecaccessrights	openAccess
dc.contributor.sponsor	Science Foundation Ireland	en
dc.contributor.sponsorGrantNumber	13/RC/210	en
dc.identifier.uri	http://hdl.handle.net/2262/89623

Files in this item

Name:: p186-roddy.pdf
Size:: 1.030Mb
Format:: PDF

View/Open

Name:: license.txt
Size:: 3.499Kb
Format:: Text file

View/Open

This item appears in the following Collection(s)

Electronic & Electrical Eng (Scholarly Publications)
Electronic & Electrical Eng (Scholarly Publications)
RSS Feeds

Show simple item record

Browse

My Account

Multimodal continuous turn-taking prediction using multiscale RNNs

Files in this item

This item appears in the following Collection(s)