• DocumentCode
    25280
  • Title

    Harmonics Plus Noise Model Based Vocoder for Statistical Parametric Speech Synthesis

  • Author

    Erro, Daniel ; Sainz, I. ; Navas, Eva ; Hernaez, Inma

  • Author_Institution
    AhoLab Signal Process. Lab., Univ. of the Basque Country (UPV/EHU), Bilbao, Spain
  • Volume
    8
  • Issue
    2
  • fYear
    2014
  • fDate
    Apr-14
  • Firstpage
    184
  • Lastpage
    194
  • Abstract
    This article explores the potential of the harmonics plus noise model of speech in the development of a high-quality vocoder applicable in statistical frameworks, particularly in modern speech synthesizers. It presents an extensive explanation of all the different alternatives considered during the design of the HNM-based vocoder, together with the corresponding objective and subjective experiments, and a careful description of its implementation details. Three aspects of the analysis have been investigated: refinement of the pitch estimation using quasi-harmonic analysis, study and comparison of several spectral envelope analysis procedures, and strategies to analyze and model the maximum voiced frequency. The performance of the resulting vocoder is shown to be similar to that of state-of-the-art vocoders in synthesis tasks.
  • Keywords
    harmonic analysis; speech synthesis; statistical analysis; vocoders; HNM-based vocoder; harmonics plus noise model; maximum voiced frequency; objective experiments; pitch estimation; quasi-harmonic analysis; spectral envelope analysis procedures; statistical parametric speech synthesis; subjective experiments; Cepstral analysis; Estimation; Harmonic analysis; Interpolation; Noise; Speech; Vocoders; Harmonics plus noise model; statistical parametric speech synthesis; vocoder; voice transformation;
  • fLanguage
    English
  • Journal_Title
    Selected Topics in Signal Processing, IEEE Journal of
  • Publisher
    ieee
  • ISSN
    1932-4553
  • Type

    jour

  • DOI
    10.1109/JSTSP.2013.2283471
  • Filename
    6609074