DocumentCode :
1693112
Title :
Contextual partial additive structure for HMM-based speech synthesis
Author :
Takaki, Shinji ; Nankaku, Yoshihiko ; Tokuda, Keiichi
Author_Institution :
Dept. of Comput. Sci. & Eng., Nagoya Inst. of Technol., Nagoya, Japan
fYear :
2013
Firstpage :
7878
Lastpage :
7882
Abstract :
This paper proposes a spectral modeling technique based on a contextual partial additive structure for HMM-based speech synthesis. To represent complicated context dependencies, contextual additive structure models assume multiple independent components which have different context dependencies to form acoustic features. In additive structure models, there is a constraint that a fixed number of additive components are used for generating acoustic features. However, it is natural to assume that the number of components depends on contexts. In the proposed technique, partial additive components affecting arbitrary contextual sub-spaces are created on demand to increase the likelihood. Then, the number of components for each context can be automatically determined with the training data. Experimental results show that the proposed technique outperformed the standard technique in a subjective test.
Keywords :
acoustic signal processing; hidden Markov models; spectral analysis; speech synthesis; HMM-based speech synthesis; acoustic feature generation; arbitrary contextual subspaces; complicated context dependencies; contextual additive structure models; contextual partial additive structure; partial additive components; spectral modeling technique; Acoustics; Additives; Context; Context modeling; Decision trees; Hidden Markov models; Standards; Context clustering; Contextual additive structure; Decision trees; Distribution convolution; HMM-based speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6639198
Filename :
6639198
Link To Document :
بازگشت