• DocumentCode
    2387368
  • Title

    A Maximum Entropy Markov Model for Prediction of Prosodic Phrase Boundaries in Chinese TTS

  • Author

    Zhao, Ziping ; Zhao, Tingjian ; Zhu, Yaoting

  • Author_Institution
    Nankai Univ., Tianjin
  • fYear
    2007
  • fDate
    2-4 Nov. 2007
  • Firstpage
    498
  • Lastpage
    498
  • Abstract
    Hierarchical prosody structure generation is a key component for a speech synthesis system. One major feature of the prosody of Mandarin Chinese speech flow is prosodic phrase grouping. In this paper a method based on maximum entropy Markov model (MEMM) is proposed to predict prosodic phrase boundaries in unrestricted Chinese text. MEMM is described in detail that combines transition probabilities and conditional probabilities of states effectively. The conditional probabilities of states are estimated by maximum entropy (ME) theory. A comparison is conducted between the new model and maximum entropy model for prosody phrase break prediction. The experiments show that utilizing the same feature set, MEMM improves overall performance. The precision and recall ratio are improved.
  • Keywords
    Markov processes; maximum entropy methods; natural language processing; probability; speech synthesis; Mandarin Chinese speech flow; conditional probability; hierarchical prosody structure generation; maximum entropy Markov model; prosodic phrase boundary prediction; text-to-speech synthesis system; transition probability; Educational institutions; Entropy; Geographic Information Systems; Hidden Markov models; Predictive models; Probability; Speech synthesis; State estimation; Statistical analysis; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Granular Computing, 2007. GRC 2007. IEEE International Conference on
  • Conference_Location
    Fremont, CA
  • Print_ISBN
    978-0-7695-3032-1
  • Type

    conf

  • DOI
    10.1109/GrC.2007.66
  • Filename
    4403149