Show simple item record

dc.contributor.authorHarte, Naomi
dc.date.accessioned2019-10-08T08:53:39Z
dc.date.available2019-10-08T08:53:39Z
dc.date.issued2018
dc.date.submitted2018en
dc.identifier.citationRoddy, M., Skantze, G., Harte, N. Multimodal continuous turn-taking prediction using multiscale RNNs, ICMI’18, October 16-20, 2018, Boulder, CO, USAen
dc.identifier.otherY
dc.description.abstractIn human conversational interactions, turn-taking exchanges can be coordinated using cues from multiple modalities. To design spoken dialog systems that can conduct fluid interactions it is desirable to incorporate cues from separate modalities into turn-taking models. We propose that there is an appropriate temporal granularity at which modalities should be modeled. We design a multiscale RNN architecture to model modalities at separate timescales in a continuous manner. Our results show that modeling linguistic and acoustic features at separate temporal rates can be beneficial for turn-taking modeling. We also show that our approach can be used to incorporate gaze features into turn-taking models.en
dc.format.extent186-190en
dc.language.isoenen
dc.rightsYen
dc.subjectSpoken dialog systemsen
dc.subjectTurn-taking modelingen
dc.titleMultimodal continuous turn-taking prediction using multiscale RNNsen
dc.title.alternativeICMI 2018 - 20th ACM International Conference on Multimodal Interactionen
dc.typeConference Paperen
dc.type.supercollectionscholarly_publicationsen
dc.type.supercollectionrefereed_publicationsen
dc.identifier.peoplefinderurlhttp://people.tcd.ie/nharte
dc.identifier.rssinternalid205162
dc.identifier.doihttp://dx.doi.org/10.1145/3242969.3242997
dc.rights.ecaccessrightsopenAccess
dc.contributor.sponsorScience Foundation Irelanden
dc.contributor.sponsorGrantNumber13/RC/210en
dc.identifier.urihttp://hdl.handle.net/2262/89623


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record