• DocumentCode
    177500
  • Title

    A postfilter to modify the modulation spectrum in HMM-based speech synthesis

  • Author

    Takamichi, Shinnosuke ; Toda, Takechi ; Neubig, Graham ; Sakti, Sakriani ; Nakamura, Shigenari

  • Author_Institution
    Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Ikoma, Japan
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    290
  • Lastpage
    294
  • Abstract
    In this paper, we propose a postfilter to compensate modulation spectrum in HMM-based speech synthesis. In order to alleviate over-smoothing effects which is a main cause of quality degradation in HMM-based speech synthesis, it is necessary to consider features that can capture over-smoothing. Global Variance (GV) is one well-known example of such a feature, and the effectiveness of parameter generation algorithm considering GV have been confirmed. However, the quality gap between natural speech and synthetic speech is still large. In this paper, we introduce the Modulation Spectrum (MS) of speech parameter trajectory as a new feature to effectively capture the over-smoothing effect, and we propose a postfilter based on the MS. The MS is represented as a power spectrum of the parameter trajectory. The generated speech parameter sequence is filtered to ensure that its MS has a pattern similar to natural speech. Experimental results show quality improvements when the proposed methods are applied to spectral and F0 components, compared with conventional methods considering GV.
  • Keywords
    filtering theory; hidden Markov models; speech synthesis; GV; HMM-based speech synthesis; MS; global variance; hidden Markov models; modulation spectrum; natural speech; over-smoothing effect; parameter generation algorithm; postfilter; quality gap; speech parameter trajectory; synthetic speech; Hidden Markov models; Modulation; Natural languages; Speech; Speech synthesis; Training; Trajectory; HMM-based speech synthesis; global variance; modulation spectrum; over-smoothing; postfilter;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853604
  • Filename
    6853604