DocumentCode :
2705763
Title :
HMM-Based Hierarchical Unit Selection Combining Kullback-Leibler Divergence with Likelihood Criterion
Author :
Ling, Zhen-Hua ; Wang, Ren-Hua
Author_Institution :
IFlytek Speech Lab., Univ. of Sci. & Technol. of China, Hefei
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
This paper presents a hidden Markov model (HMM) based unit selection method using hierarchical units under statistical criterion. In our previous work we tried to use frame sized speech segments and maximum likelihood criterion to improve the performance of traditional concatenative synthesis system using phone sized units and cost function criterion. In this paper, hierarchical units which consist of phone level units and frame level units are adopted to achieve better balance between the coverage rate of candidate unit and the number of concatenation points during synthesis. Besides, Kullback-Leibler divergence (KLD) between candidate and target phoneme HMMs is introduced as a part of the final criterion for unit selection. The listening result proves that these two approaches can improve the performance of synthetic speech effectively.
Keywords :
hidden Markov models; maximum likelihood estimation; speech synthesis; HMM-based hierarchical unit selection; Kullback-Leibler divergence; concatenative synthesis system; cost function criterion; frame level units; frame sized speech segments; hidden Markov model; likelihood criterion; maximum likelihood criterion; phone sized units; statistical criterion; Context modeling; Cost function; Databases; Diversity reception; Dynamic programming; Flowcharts; Hidden Markov models; Signal synthesis; Speech synthesis; Synthesizers; HMM; KLD; Speech Synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2007.367302
Filename :
4218333
Link To Document :
بازگشت