• DocumentCode
    3527993
  • Title

    Trajectory training considering global variance for HMM-based speech synthesis

  • Author

    Toda, Tomoki ; Young, Steve

  • Author_Institution
    Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol. (NAIST), Nara
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4025
  • Lastpage
    4028
  • Abstract
    This paper presents a novel method for training hidden Markov models (HMMs) for use in HMM-based speech synthesis. The primary goal of HMM parameter optimization is to ensure that parameters generated from the trained models exhibit similar properties to natural speech. In this paper, two major problems in conventional training are addressed: 1) the inconsistency between the training and synthesis optimization criterion; and 2) the over-smoothing caused by the statistical modeling process. The proposed method integrates the global variance (GV) criterion into a trajectory training method to give a unified framework for both training and synthesis which provides both a consistent optimization criterion and a closed form solution for parameter generation. The experimental results demonstrate that the proposed method yields a significant improvement in the naturalness of synthetic speech.
  • Keywords
    hidden Markov models; speech synthesis; HMM-based speech synthesis; global variance criterion; hidden Markov models; synthesis optimization criterion; trajectory training method; Acoustics; Constraint optimization; Hidden Markov models; Information science; Natural languages; Optimization methods; Signal synthesis; Smoothing methods; Speech processing; Speech synthesis; global variance; hidden Markov models; speech synthesis; training criterion; trajectory likelihood;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960511
  • Filename
    4960511