DocumentCode :
3425182
Title :
Speaker and style adaptation using average voice model for style control in HMM-based speech synthesis
Author :
Tachibana, Makoto ; Izawa, Shinsuke ; Nose, Takashi ; Kobayashi, Takao
Author_Institution :
Interdiscipl. Grad. Sch. of Sci. & Eng., Tokyo Inst. of Technol., Yokohama
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
4633
Lastpage :
4636
Abstract :
We propose a technique for synthesizing speech with desired style expressivity of an arbitrary target speaker´s voice. In an MLLR-based speaker adaptation technique for multiple regression hidden semi-Markov model (MRHSMM), the quality of synthesized speech crucially depends on the initial MRHSMM trained from a certain source speaker´s data and it is not always possible to synthesize natural sounding speech with a given target speaker´s voice. To overcome this problem, we perform simultaneous adaptation of speaker and style from an average voice model. Experimental results show that the proposed technique provides more natural sounding speech than the conventional one with speaker adaptation only.
Keywords :
hidden Markov models; regression analysis; speaker recognition; speech synthesis; MLLR-based speaker adaptation; average voice model; multiple regression hidden semiMarkov model; speaker voice; speech quality; speech style; speech synthesis; Adaptation model; Control system synthesis; Costs; Covariance matrix; Hidden Markov models; Least squares methods; Loudspeakers; Nose; Probability density function; Speech synthesis; average voice model; hidden Markov model; speaker adaptation; style control;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518689
Filename :
4518689
Link To Document :
بازگشت