dc.contributor.author | Gobl, Christer | |
dc.date.accessioned | 2025-02-16T15:32:00Z | |
dc.date.available | 2025-02-16T15:32:00Z | |
dc.date.issued | 2024 | |
dc.date.submitted | 2024 | en |
dc.identifier.citation | Giovannini, A. M., Wang, Z., O'Reilly, M., Ní Chasaide, A., Gobl, C., Voice transforms for affect control in Irish speech synthesis, Speech Prosody 2024, Leiden, the Netherlands, 2024, 299 - 303 | en |
dc.identifier.other | Y | |
dc.description.abstract | This paper reports on an experiment using voice transforms to
alter the perceived affect in synthetic utterances of Irish, with a
view to controlling affect in the spoken output of an Irish AAC
device. The transforms were guided by prior experience and by
voice source analyses of utterances by a male speaker with an
angry, happy, sad, bored, relaxed and neutral voice. The neu-
tral utterance was modified to incorporate stylised voice trans-
forms targeting these affects. Modifications included global
shifts affecting the entire utterance, local shifts affecting only
accented syllables, and a combination of global and local
changes. Stimuli targeting sad and happy included tempo
changes and formant shifts were included for happy. Listeners’
evaluations most positively identified the high activation affects
happy and angry. Stimuli targeting sad were also effective,
while those targeting bored and relaxed were not, although
bored was positively associated with some of the sad-targeting
stimuli. Results for low activations states are confounded by the
fact that the neutral stimulus was to some degree biased
towards bored, sad and relaxed affects. Of the three types of
transforms, global, local and combined, the most effective
appears to vary with the targeted affect. | en |
dc.format.extent | 299 | en |
dc.format.extent | 303 | en |
dc.language.iso | en | en |
dc.rights | Y | en |
dc.subject | voice quality, voice source, affect, emotion, speech synthesis, emotion, Irish | en |
dc.title | Voice transforms for affect control in Irish speech synthesis | en |
dc.title.alternative | Speech Prosody 2024 | en |
dc.type | Conference Paper | en |
dc.type.supercollection | scholarly_publications | en |
dc.type.supercollection | refereed_publications | en |
dc.identifier.peoplefinderurl | http://people.tcd.ie/cegobl | |
dc.identifier.rssinternalid | 274737 | |
dc.identifier.doi | https://doi.org/10.21437/SpeechProsody.2024-61 | |
dc.rights.ecaccessrights | openAccess | |
dc.subject.TCDTag | Emotion in Speech | en |
dc.subject.TCDTag | Voice quality | en |
dc.identifier.orcid_id | 0000-0002-5958-3891 | |
dc.status.accessible | N | en |
dc.contributor.sponsor | Irish Research Council (IRC) | en |
dc.contributor.sponsorGrantNumber | GOIPG/2021/561 | en |
dc.contributor.sponsor | Government of Ireland | en |
dc.identifier.uri | https://hdl.handle.net/2262/110904 | |