DocumentCode
2703906
Title
A Speaker Adaptation Technique for MRHSMM-Based Style Control of Synthetic Speech
Author
Nose, Takashi ; Kato, Yoichi ; Kobayashi, Takao
Author_Institution
Interdisciplinary Grad. Sch. of Sci. & Eng., Tokyo Inst. of Technol., Yokohama
Volume
4
fYear
2007
fDate
15-20 April 2007
Abstract
This paper describes a speaker adaptation technique for style control based on multiple regression hidden semi-Markov model (MRHSMM), In the MRHSMM-based style control technique, when available training data is very small, the resultant model would produce unnatural sounding speech. To overcome this problem, we propose a model adaptation technique for MRHSMM, which is similar to the MLLR adaptation technique used in speech recognition and speech synthesis. We formulate the model adaptation problem for MRHSMM based on a linear transformation framework and derive re-estimation formulas for transformation matrices in ML sense. We also describe the results of subjective evaluation tests.
Keywords
hidden Markov models; speaker recognition; speech synthesis; MRHSMM; linear transformation framework; model adaptation technique; multiple regression hidden semiMarkov model; speaker adaptation technique; speech recognition; speech synthesis; style control technique; style synthetic speech; transformation matrices; unnatural sounding speech; Adaptation model; Context modeling; Hidden Markov models; Loudspeakers; Maximum likelihood linear regression; Nose; Speech recognition; Speech synthesis; Testing; Training data; Expressive speech synthesis; Hidden Markov model; MLLR; Speaker adaptation; Style control;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2007.367042
Filename
4218230
Link To Document