• DocumentCode
    730693
  • Title

    The effect of neural networks in statistical parametric speech synthesis

  • Author

    Hashimoto, Kei ; Oura, Keiichiro ; Nankaku, Yoshihiko ; Tokuda, Keiichi

  • Author_Institution
    Dept. of Sci. & Eng. Simulation, Nagoya Inst. of Technol., Nagoya, Japan
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    4455
  • Lastpage
    4459
  • Abstract
    This paper investigates how to use neural networks in statistical parametric speech synthesis. Recently, deep neural networks (DNNs) have been used for statistical parametric speech synthesis. However, the specific way how DNNs should be used in statistical parametric speech synthesis has not been studied thoroughly. A generation process of statistical parametric speech synthesis based on generative models can be divided into several components, and those components can be represented by DNNs. In this paper, the effect of DNNs for each component is investigated by comparing DNNs with generative models. Experimental results show that the use of a DNN as acoustic models is effective and the parameter generation combined with a DNN improves the naturalness of synthesized speech.
  • Keywords
    neural nets; speech synthesis; statistical analysis; acoustic models; deep neural networks; generative models; parameter generation; statistical parametric speech synthesis; Artificial neural networks; Hidden Markov models; Speech; Statistical parametric speech synthesis; deep neural network; hidden Markov model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178813
  • Filename
    7178813