DocumentCode :
1652213
Title :
Emphasized speech synthesis based on hidden Markov models
Author :
Morizane, Kumiko ; Nakamura, Keigo ; Toda, Tomoki ; Saruwatari, Hiroshi ; Shikano, Kiyohiro
Author_Institution :
Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan
fYear :
2009
Firstpage :
76
Lastpage :
81
Abstract :
This paper presents a statistical approach to synthesizing emphasized speech based on hidden Markov models (HMMs). Context-dependent HMMs are trained using emphasized speech data uttered by intentionally emphasizing an arbitrary accentual phrase in a sentence. To model acoustic characteristics of emphasized speech, new contextual factors describing an emphasized accentual phrase are additionally considered in model training. Moreover, to build HMMs for synthesizing both normal speech and emphasized speech, we investigate two training methods; one is training of individual models for normal and emphasized speech using each of these two types of speech data separately; and the other is training of a mixed model using both of them simultaneously. The experimental results demonstrate that 1) HMM-based speech synthesis is effective for synthesizing emphasized speech and 2) the mixed model allows a more compact HMM set generating more naturally sounding but slightly less emphasized speech compared with the individual models.
Keywords :
hidden Markov models; speech synthesis; context-dependent HMM; hidden Markov models; speech synthesis; Communication system control; Context modeling; Control system synthesis; Databases; Hidden Markov models; Loudspeakers; Navigation; Signal generators; Speech synthesis; User interfaces;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speech Database and Assessments, 2009 Oriental COCOSDA International Conference on
Conference_Location :
Urumqi
Print_ISBN :
978-1-4244-4400-7
Electronic_ISBN :
978-1-4244-4400-7
Type :
conf
DOI :
10.1109/ICSDA.2009.5278371
Filename :
5278371
Link To Document :
بازگشت