DocumentCode :
3123741
Title :
Cross-stream dependency modeling using continuous F0 model for HMM-based speech synthesis
Author :
Xin Wang ; Zhen-Hua Ling ; Li-Rong Dai
Author_Institution :
iFLYTEK Speech Lab., Univ. of Sci. & Technol. of China, Hefei, China
fYear :
2012
fDate :
5-8 Dec. 2012
Firstpage :
84
Lastpage :
87
Abstract :
In our previous work, we have presented a cross-stream dependency modeling method for hidden Markov model (HMM) based parametric speech synthesis. In this method, multi-space probability distribution (MSD) was adopted for F0 modeling and the voicing decision error influenced the accuracy of generated spectral features severely. Therefore, a cross-stream dependency modeling method using continuous F0 HMM (CF-HMM) is proposed in this paper to circumvent voicing decision during the generation of spectral features. Besides, in order to prevent over-fitting problem in model training, regression class is introduced to tie the transform matrices in dependency models. Experiments on proposed methods show both improvement on the accuracy of the generated spectral features and effectiveness of introducing regression class in dependency model training.
Keywords :
hidden Markov models; speech synthesis; statistical distributions; HMM-based speech synthesis; MSD; continuous F0 model; cross-stream dependency modeling; hidden Markov model; multispace probability distribution; parametric speech synthesis; Accuracy; Feature extraction; Hidden Markov models; Speech; Speech synthesis; Training; Transforms; continuous F0 model; cross-stream dependency; hidden Markov model; regression class; speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location :
Kowloon
Print_ISBN :
978-1-4673-2506-6
Electronic_ISBN :
978-1-4673-2505-9
Type :
conf
DOI :
10.1109/ISCSLP.2012.6423457
Filename :
6423457
Link To Document :
بازگشت