• DocumentCode
    2175484
  • Title

    HNM-based MFCC+F0 extractor applied to statistical speech synthesis

  • Author

    Erro, Daniel ; Sainz, Iñaki ; Navas, Eva ; Hernáez, Inma

  • Author_Institution
    AHOLAB Signal Process. Lab., Univ. of the Basque Country, Bilbao, Spain
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    4728
  • Lastpage
    4731
  • Abstract
    Currently, the statistical framework based on Hidden Markov Models (HMMs) plays a relevant role in speech synthesis, while voice conversion systems based on Gaussian Mixture Models (GMMs) are almost standard. In both cases, statistical modeling is applied to learn distributions of acoustic vectors extracted from speech signals, each vector containing a suitable parametric representation of one speech frame. The overall performance of the systems is often limited by the accuracy of the underlying speech parameterization and reconstruction method. The method presented in this paper allows accurate MFCC extraction and high-quality reconstruction of speech signals assuming a Harmonics plus Noise Model (HNM). Its suitability for high-quality HMM-based speech synthesis is shown through subjective tests.
  • Keywords
    hidden Markov models; speech synthesis; Gaussian mixture models; HNM-based MFCC+FO extractor; acoustic vectors; harmonic plus noise model; hidden Markov models; speech parameterization method; speech reconstruction method; speech signals; statistical modeling; statistical speech synthesis; Cepstral analysis; Harmonic analysis; Hidden Markov models; Noise; Power harmonic filters; Speech; Speech synthesis; harmonics plus noise model; speech parameterization; statistical parametric speech synthesis; voice conversion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947411
  • Filename
    5947411