DocumentCode :
591468
Title :
Annotating conversational speech for corpus-based dialogue speech synthesizer — A first step
Author :
Mori, Hisamichi ; Hitomi, Tadaaki
Author_Institution :
Grad. Sch. of Eng., Utsunomiya Univ., Utsunomiya, Japan
fYear :
2012
fDate :
9-12 Dec. 2012
Firstpage :
135
Lastpage :
140
Abstract :
This paper describes an HMM-based speech synthesis that allows dimensional description of emotion as inputs. A spontaneous dialogue speech corpus that was designed for studying paralinguistic phenomena in expressive social interactions was used to train the models, utilizing its emotional state description as additional contextual factors. In the perceptual experiment, a very high correlation was observed (R ≃ 0.8) between given pleasantness/arousal values and averaged subjective evaluations, which means that the synthesized utterances could successfully convey specified paralinguistic information.
Keywords :
audio databases; hidden Markov models; linguistics; speech synthesis; HMM-based speech synthesis; arousal value; conversational speech annotation; corpus-based dialogue speech synthesizer; emotion dimensional description; emotional state description; expressive social interaction; hidden Markov model; paralinguistic information; paralinguistic phenomenon; pleasantness value; spontaneous dialogue speech corpus; subjective evaluation; Context; Correlation; Databases; Hidden Markov models; Pragmatics; Speech; Speech synthesis; HMM-based speech synthesis; UU Database; spontaneous speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speech Database and Assessments (Oriental COCOSDA), 2012 International Conference on
Conference_Location :
Macau
Print_ISBN :
978-1-4673-2811-1
Electronic_ISBN :
978-1-4673-2812-8
Type :
conf
DOI :
10.1109/ICSDA.2012.6422461
Filename :
6422461
Link To Document :
بازگشت