Title :
A Maximum Entropy Based Hierarchical Model for Automatic Prosodic Boundary Labeling in Mandarin
Author :
Liu, Fangzhou ; Jia, Huibin ; Tao, Jianhua
Author_Institution :
Nat. Lab. of Pattern Recognition, Chinese Acad. of Sci., Beijing, China
Abstract :
Modeling prosodic rhythm is of great importance for both speech synthesis and speech understanding, and it requires a large enough corpus with precise prosodic boundary labels. This paper proposes a maximum entropy (ME) based hierarchical model, which utilizes both text and acoustic features, to automatically label Mandarin prosodic boundaries. Results of comparative experiments show that, for the task of prosodic boundary detection, ME model obviously outperforms classification and regression tree (CART), and the bottom-up hierarchical framework is also significantly superior to the flat single-level framework.
Keywords :
acoustic signal processing; maximum entropy methods; natural languages; speech synthesis; Mandarin; acoustic feature; automatic prosodic boundary detection; maximum entropy based hierarchical model; prosodic rhythm; speech synthesis; text feature; Classification tree analysis; Entropy; Hidden Markov models; Labeling; Laboratories; Pattern recognition; Predictive models; Regression tree analysis; Rhythm; Speech synthesis;
Conference_Titel :
Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2942-4
Electronic_ISBN :
978-1-4244-2943-1
DOI :
10.1109/CHINSL.2008.ECP.76