• DocumentCode
    3164484
  • Title

    An F0 modeling technique based on prosodic events for spontaneous speech synthesis

  • Author

    Koriyama, Tomoki ; Nose, Takashi ; Kobayashi, Takao

  • Author_Institution
    Interdiscipl. Grad. Sch. of Sci. & Eng., Tokyo Inst. of Technol., Yokohama, Japan
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4589
  • Lastpage
    4592
  • Abstract
    This paper proposes a technique for effective modeling of F0 contours using prosodic-event-based HMM units for HMM-based spontaneous speech synthesis. The modeling unit corresponds to one of prosodic event segments such as pitch falling by accent and pitch rising by boundary pitch movement (BPM). Since the prosodic events of one phrase are generally less frequent than the changes of phonemes, the proposed unit is expected to reduce the number of model parameters of F0, which leads to robust parameter estimation. The objective and subjective experiments using spontaneous conversational speech data show that the proposed technique can significantly reduce the number of model parameters while keeping the naturalness of the synthetic speech.
  • Keywords
    hidden Markov models; speech synthesis; BPM; F0 modeling technique; accent; boundary pitch movement; hidden Markov model; objective experiments; pitch falling; pitch rising; prosodic-event-based HMM units; robust parameter estimation; spontaneous speech synthesis; subjective experiments; Context; Correlation; Hidden Markov models; Nose; Speech; Speech synthesis; Timing; F0 modeling; HMM-based speech synthesis; Prosodic events; Spontaneous speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288940
  • Filename
    6288940