DocumentCode
2387368
Title
A Maximum Entropy Markov Model for Prediction of Prosodic Phrase Boundaries in Chinese TTS
Author
Zhao, Ziping ; Zhao, Tingjian ; Zhu, Yaoting
Author_Institution
Nankai Univ., Tianjin
fYear
2007
fDate
2-4 Nov. 2007
Firstpage
498
Lastpage
498
Abstract
Hierarchical prosody structure generation is a key component for a speech synthesis system. One major feature of the prosody of Mandarin Chinese speech flow is prosodic phrase grouping. In this paper a method based on maximum entropy Markov model (MEMM) is proposed to predict prosodic phrase boundaries in unrestricted Chinese text. MEMM is described in detail that combines transition probabilities and conditional probabilities of states effectively. The conditional probabilities of states are estimated by maximum entropy (ME) theory. A comparison is conducted between the new model and maximum entropy model for prosody phrase break prediction. The experiments show that utilizing the same feature set, MEMM improves overall performance. The precision and recall ratio are improved.
Keywords
Markov processes; maximum entropy methods; natural language processing; probability; speech synthesis; Mandarin Chinese speech flow; conditional probability; hierarchical prosody structure generation; maximum entropy Markov model; prosodic phrase boundary prediction; text-to-speech synthesis system; transition probability; Educational institutions; Entropy; Geographic Information Systems; Hidden Markov models; Predictive models; Probability; Speech synthesis; State estimation; Statistical analysis; Viterbi algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Granular Computing, 2007. GRC 2007. IEEE International Conference on
Conference_Location
Fremont, CA
Print_ISBN
978-0-7695-3032-1
Type
conf
DOI
10.1109/GrC.2007.66
Filename
4403149
Link To Document