A Speaker Adaptation Technique for MRHSMM-Based Style Control of Synthetic Speech

Author

Nose, Takashi ; Kato, Yoichi ; Kobayashi, Takao

Author_Institution

Interdisciplinary Grad. Sch. of Sci. & Eng., Tokyo Inst. of Technol., Yokohama

Volume

4

fYear

2007

fDate

15-20 April 2007

Abstract

This paper describes a speaker adaptation technique for style control based on multiple regression hidden semi-Markov model (MRHSMM), In the MRHSMM-based style control technique, when available training data is very small, the resultant model would produce unnatural sounding speech. To overcome this problem, we propose a model adaptation technique for MRHSMM, which is similar to the MLLR adaptation technique used in speech recognition and speech synthesis. We formulate the model adaptation problem for MRHSMM based on a linear transformation framework and derive re-estimation formulas for transformation matrices in ML sense. We also describe the results of subjective evaluation tests.

Keywords

hidden Markov models; speaker recognition; speech synthesis; MRHSMM; linear transformation framework; model adaptation technique; multiple regression hidden semiMarkov model; speaker adaptation technique; speech recognition; speech synthesis; style control technique; style synthetic speech; transformation matrices; unnatural sounding speech; Adaptation model; Context modeling; Hidden Markov models; Loudspeakers; Maximum likelihood linear regression; Nose; Speech recognition; Speech synthesis; Testing; Training data; Expressive speech synthesis; Hidden Markov model; MLLR; Speaker adaptation; Style control;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on

Conference_Location

Honolulu, HI

ISSN

1520-6149

Print_ISBN

1-4244-0727-3

Electronic_ISBN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2007.367042

Filename

4218230