• DocumentCode
    508227
  • Title

    Chinese Prosody Structure Prediction Based on Conditional Random Fields

  • Author

    Sun, Jingwei ; Yang, Jing ; Zhang, Jianping ; Yan, Yonghong

  • Author_Institution
    Thinkit Speech Lab., Chinese Acad. of Sci., Beijing, China
  • Volume
    3
  • fYear
    2009
  • fDate
    14-16 Aug. 2009
  • Firstpage
    602
  • Lastpage
    606
  • Abstract
    In this paper, a novel statistical method based on conditional random fields (CRF) is proposed for hierarchical prosody structure prediction, which is a key module in speech synthesis systems. We will discuss how to build the prosody models for mandarin Chinese using conditional random fields in detail, including corpus preparation, feature selection, feature template design, model training and evaluation. Comparison is conducted between the new method and the classical decision tree based one. The experimental results show that CRF-based method can significantly improve the overall performance with the same feature set.
  • Keywords
    decision trees; speech synthesis; Chinese prosody structure prediction; conditional random fields; corpus preparation; decision tree; feature selection; feature template design; speech synthesis systems; statistical method; Acoustics; Decision trees; Entropy; Hidden Markov models; Labeling; Laboratories; Predictive models; Speech synthesis; Statistical analysis; Sun;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Computation, 2009. ICNC '09. Fifth International Conference on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-0-7695-3736-8
  • Type

    conf

  • DOI
    10.1109/ICNC.2009.44
  • Filename
    5366089