• DocumentCode
    417119
  • Title

    Pitch prediction from MFCC vectors for speech reconstruction

  • Author

    Shao, Xu ; Milner, Ben

  • Author_Institution
    Sch. of Comput. Sci., Univ. of East Anglia, Norwich, UK
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    The paper proposes a technique for reconstructing an acoustic speech signal solely from a stream of Mel-frequency cepstral coefficients (MFCCs). Previous speech reconstruction methods have required an additional pitch element, but this work proposes two maximum a posteriori (MAP) methods for predicting pitch from the MFCC vectors themselves. The first method is based on a Gaussian mixture model (GMM) while the second scheme utilises the temporal correlation available from a hidden Markov model (HMM) framework. A formal measurement of both frame classification accuracy and RMS pitch error shows that an HMM-based scheme with 5 clusters per state is able to classify correctly over 94% of frames and has an RMS pitch error of 3.1 Hz in comparison to a reference pitch. Informal listening tests and analysis of spectrograms reveals that speech reconstructed solely from the MFCC vectors is almost indistinguishable from that using the reference pitch.
  • Keywords
    Gaussian processes; acoustic correlation; hidden Markov models; maximum likelihood estimation; signal reconstruction; speech processing; Gaussian mixture model; HMM; MAP methods; MFCC; MFCC vectors; Mel-frequency cepstral coefficients; acoustic speech signal; frame classification accuracy; hidden Markov model; maximum a posteriori methods; pitch error; pitch prediction; spectrograms; speech reconstruction; temporal correlation; Data mining; Error correction; Frequency estimation; Hidden Markov models; Mel frequency cepstral coefficient; Speech analysis; Speech codecs; Speech processing; Speech recognition; Speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1325931
  • Filename
    1325931