DocumentCode :
2798945
Title :
Cross-validation based decision tree clustering for HMM-based TTS
Author :
Zhang, Yu ; Yan, Zhi-Jie ; Soong, Frank K.
Author_Institution :
Microsoft Res. Asia, Beijing, China
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
4602
Lastpage :
4605
Abstract :
In HMM-based speech synthesis, we usually use complex, context dependent models to characterize prosodically and linguistically rich speech units. It is therefore difficult to prepare training data which can cover all combinatorial possibilities of contexts. A common approach to cope with this insufficient training data problem is to build a clustered tree via the MDL criterion. However, an MDL-based tree still tends to be inadequate in its power to predict unseen data. In this paper, we adopt the cross-validation principle to build such a decision tree to minimize the generation error of unseen contexts. An efficient training algorithm is implemented by exploiting the sufficient statistics. Experimental results show that the proposed method can achieve better speech synthesis results, both objectively and subjectively, than the baseline results of the MDL-based decision tree.
Keywords :
decision trees; hidden Markov models; pattern clustering; speech synthesis; statistical analysis; MDL criterion; contexts; cross-validation; decision tree clustering; generation error; hmm-based TTS; linguistically rich speech units; speech synthesis; statistics; training algorithm; Asia; Clustering algorithms; Context modeling; Decision trees; Hidden Markov models; Predictive models; Speech synthesis; Statistics; Stress; Training data; HMM-based speech synthesis; MDL; context clustering; cross validation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5495560
Filename :
5495560
Link To Document :
بازگشت