• DocumentCode
    3412428
  • Title

    Control of fundamental frequency contours using the generation process model in HMM-based speech synthesis

  • Author

    Matsuda, Tetsuya ; Hirose, Keikichi ; Minematsu, Nobuaki

  • Author_Institution
    Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
  • fYear
    2010
  • fDate
    24-28 Oct. 2010
  • Firstpage
    617
  • Lastpage
    620
  • Abstract
    A method was proposed to increase the naturalness of prosody generated with speech synthesis based on hidden Markov models (HMMs). This method adds a constraint to the fundamental frequency contours (F0 contours) during the HMM-based speech synthesis. The constraint adopted is the generation process model of F0 contours (F0 model). The method first extracts the F0 model parameters from the original Fo contour (generated by the HMM-based speech synthesis) and then optimizes them successively by referring to the pre-trained HMMs. The experimental results show that the proposed method can improve naturalness when the F0 control by the original method is inadequate.
  • Keywords
    hidden Markov models; speech synthesis; F0 control; fundamental frequency contours; generation process model; hidden Markov models; speech synthesis; Context modeling; Hidden Markov models; Optimization; Smoothing methods; Speech; Speech synthesis; Fo contour; HMM-based speech synthesis; generation process model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing (ICSP), 2010 IEEE 10th International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-5897-4
  • Type

    conf

  • DOI
    10.1109/ICOSP.2010.5656358
  • Filename
    5656358