DocumentCode
2066517
Title
A Maximum Entropy Based Hierarchical Model for Automatic Prosodic Boundary Labeling in Mandarin
Author
Liu, Fangzhou ; Jia, Huibin ; Tao, Jianhua
Author_Institution
Nat. Lab. of Pattern Recognition, Chinese Acad. of Sci., Beijing, China
fYear
2008
fDate
16-19 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
Modeling prosodic rhythm is of great importance for both speech synthesis and speech understanding, and it requires a large enough corpus with precise prosodic boundary labels. This paper proposes a maximum entropy (ME) based hierarchical model, which utilizes both text and acoustic features, to automatically label Mandarin prosodic boundaries. Results of comparative experiments show that, for the task of prosodic boundary detection, ME model obviously outperforms classification and regression tree (CART), and the bottom-up hierarchical framework is also significantly superior to the flat single-level framework.
Keywords
acoustic signal processing; maximum entropy methods; natural languages; speech synthesis; Mandarin; acoustic feature; automatic prosodic boundary detection; maximum entropy based hierarchical model; prosodic rhythm; speech synthesis; text feature; Classification tree analysis; Entropy; Hidden Markov models; Labeling; Laboratories; Pattern recognition; Predictive models; Regression tree analysis; Rhythm; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
Conference_Location
Kunming
Print_ISBN
978-1-4244-2942-4
Electronic_ISBN
978-1-4244-2943-1
Type
conf
DOI
10.1109/CHINSL.2008.ECP.76
Filename
4730330
Link To Document