مرکز منطقه ای اطلاع رساني علوم و فناوري - Cross-stream dependency modeling using continuous F0 model for HMM-based speech synthesis

DocumentCode :

3123741

Title :

Cross-stream dependency modeling using continuous F0 model for HMM-based speech synthesis

Author :

Xin Wang ; Zhen-Hua Ling ; Li-Rong Dai

Author_Institution :

iFLYTEK Speech Lab., Univ. of Sci. & Technol. of China, Hefei, China

fYear :

2012

fDate :

5-8 Dec. 2012

Firstpage :

Lastpage :

Abstract :

In our previous work, we have presented a cross-stream dependency modeling method for hidden Markov model (HMM) based parametric speech synthesis. In this method, multi-space probability distribution (MSD) was adopted for F0 modeling and the voicing decision error influenced the accuracy of generated spectral features severely. Therefore, a cross-stream dependency modeling method using continuous F0 HMM (CF-HMM) is proposed in this paper to circumvent voicing decision during the generation of spectral features. Besides, in order to prevent over-fitting problem in model training, regression class is introduced to tie the transform matrices in dependency models. Experiments on proposed methods show both improvement on the accuracy of the generated spectral features and effectiveness of introducing regression class in dependency model training.

Keywords :

hidden Markov models; speech synthesis; statistical distributions; HMM-based speech synthesis; MSD; continuous F0 model; cross-stream dependency modeling; hidden Markov model; multispace probability distribution; parametric speech synthesis; Accuracy; Feature extraction; Hidden Markov models; Speech; Speech synthesis; Training; Transforms; continuous F0 model; cross-stream dependency; hidden Markov model; regression class; speech synthesis;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on

Conference_Location :

Kowloon

Print_ISBN :

978-1-4673-2506-6

Electronic_ISBN :

978-1-4673-2505-9

Type :

conf

DOI :

10.1109/ISCSLP.2012.6423457

Filename :

6423457

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3123741