DocumentCode
508227
Title
Chinese Prosody Structure Prediction Based on Conditional Random Fields
Author
Sun, Jingwei ; Yang, Jing ; Zhang, Jianping ; Yan, Yonghong
Author_Institution
Thinkit Speech Lab., Chinese Acad. of Sci., Beijing, China
Volume
3
fYear
2009
fDate
14-16 Aug. 2009
Firstpage
602
Lastpage
606
Abstract
In this paper, a novel statistical method based on conditional random fields (CRF) is proposed for hierarchical prosody structure prediction, which is a key module in speech synthesis systems. We will discuss how to build the prosody models for mandarin Chinese using conditional random fields in detail, including corpus preparation, feature selection, feature template design, model training and evaluation. Comparison is conducted between the new method and the classical decision tree based one. The experimental results show that CRF-based method can significantly improve the overall performance with the same feature set.
Keywords
decision trees; speech synthesis; Chinese prosody structure prediction; conditional random fields; corpus preparation; decision tree; feature selection; feature template design; speech synthesis systems; statistical method; Acoustics; Decision trees; Entropy; Hidden Markov models; Labeling; Laboratories; Predictive models; Speech synthesis; Statistical analysis; Sun;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Computation, 2009. ICNC '09. Fifth International Conference on
Conference_Location
Tianjin
Print_ISBN
978-0-7695-3736-8
Type
conf
DOI
10.1109/ICNC.2009.44
Filename
5366089
Link To Document