DocumentCode :
2705710
Title :
Model Adaptation Approach to Speech Synthesis with Diverse Voices and Styles
Author :
Yamagishi, Junichi ; Kobayashi, Takehiko ; Tachibana, Miyako ; Ogata, Kohichi ; Nakano, Yoshiaki
Author_Institution :
Interdisciplinary Graduate Sch. of Sci. & Eng., Tokyo Inst. of Technol., Japan
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
In human computer interaction and dialogue systems, it is often desirable for text-to-speech synthesis to be able to generate natural sounding speech with an arbitrary speaker´s voice and with varying speaking styles and/or emotional expressions. We have developed an average-voice-based speech synthesis method using statistical average voice models and model adaptation techniques for this purpose. In this paper, we describe an overview of the speech synthesis system and show the current performance with several experimental results.
Keywords :
human computer interaction; speech synthesis; statistical analysis; average-voice-based speech synthesis method; dialogue systems; diverse voices; emotional expressions; human computer interaction; model adaptation approach; natural sounding speech; statistical average voice models; text-to-speech synthesis; varying speaking styles; Acoustical engineering; Adaptation model; Clustering algorithms; Context modeling; Frequency estimation; Hidden Markov models; Loudspeakers; Speech synthesis; Synthesizers; Training data; HMM; average voice; speaker adaptation; speech synthesis; voice conversion;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.367299
Filename :
4218330
Link To Document :
بازگشت