• DocumentCode
    2017174
  • Title

    Prosody phrase boundary prediction with ensemble learning

  • Author

    Yi, Lifu ; Li, Jian ; He, Lei ; Hao, Jie ; Zhao, Rui

  • Author_Institution
    Tencent Inc., Beijing, China
  • fYear
    2010
  • fDate
    Nov. 29 2010-Dec. 3 2010
  • Firstpage
    397
  • Lastpage
    400
  • Abstract
    It´s necessary for TTS systems to predict the boundaries of natural prosodic phrases for synthetic speech. However, it´s lengthy and costly to annotate enough prosodic phrases manually for the model training. In this paper, we propose a practical method to extract and predict prosodic phrases with raw speech corpora even without manual prosodic phrase annotations. We extract prosodic phrase automatically from four Mandarin speech corpora that are different in narrators or recording text scripts. To alleviate the effects of such differences, we apply two ensemble learning algorithms, AdaBoost and MultiBoosting with C4.5 as the base classifiers, to predict the phrase boundaries derived from different corpora. Experiments show the two algorithms are efficient in reducing classification error, especially when training data is combined with multiple speech corpora. The best performance is achieved by MultiBoosting, in which the F-measure increases from 44.1% to 54.1%, compared with C4.5.
  • Keywords
    learning (artificial intelligence); natural languages; speech recognition; text analysis; AdaBoost; F-measure; Mandarin speech corpora; TTS system; classification error; ensemble learning; manual prosodic phrase annotation; multiboosting; multiple speech corpora; phrase boundary; prosody phrase boundary prediction; raw speech corpora; synthetic speech; text script record; Prosodic phrase; TTS; boosting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
  • Conference_Location
    Tainan
  • Print_ISBN
    978-1-4244-6244-5
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2010.5684861
  • Filename
    5684861