• DocumentCode
    2017229
  • Title

    Automatic phrase boundary labeling for Mandarin TTS corpus using context-dependent HMM

  • Author

    Yang, Chen-Yu ; Ling, Zhen-Hua ; Lu, Heng ; Guo, Wu ; Dai, Li-Rong

  • Author_Institution
    iFly Speech Lab., Univ. of Sci. & Technol. of China, Hefei, China
  • fYear
    2010
  • fDate
    Nov. 29 2010-Dec. 3 2010
  • Firstpage
    374
  • Lastpage
    377
  • Abstract
    In this paper, an automatic prosodic phrase boundary labeling method for speech synthesis database is presented. This method can be divided into two stages: training stage and labeling stage. In training stage, context-dependent HMM, which is commonly adopted in the HMM-based parametric speech synthesis, is estimated using the training database with manual prosodic labeling. In labeling stage, the maximum likelihood criterion derived from the trained HMMs and the exhaustive search method are employed to find the optimal phrase boundary positions for an input sentence based on its acoustic features. The experimental results show that an F-score of 76.46% can be achieved for the prosodic phrase boundary detection of our Mandarin TTS corpus, which is close to the accuracy of experienced human labelers.
  • Keywords
    hidden Markov models; speech synthesis; Mandarin TTS corpus; acoustic features; automatic phrase boundary labeling; context dependent HMM; hidden Markov model; manual prosodic labeling; maximum likelihood criterion; speech synthesis database; Acoustics; Context; Hidden Markov models; Labeling; Speech; Speech synthesis; Training; automatic labeling; hidden Markov model; prosodic phrase boundary; speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
  • Conference_Location
    Tainan
  • Print_ISBN
    978-1-4244-6244-5
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2010.5684864
  • Filename
    5684864