• DocumentCode
    1786592
  • Title

    A theoretically consistent method for minimum mean-square error estimation of mel-frequency cepstral features

  • Author

    Jensen, Jens ; Zheng-Hua Tan

  • Author_Institution
    Oticon A/S, Smorum, Denmark
  • fYear
    2014
  • fDate
    19-21 Sept. 2014
  • Firstpage
    368
  • Lastpage
    373
  • Abstract
    We propose a method for minimum mean-square error (MMSE) estimation of mel-frequency cepstral features for noise robust automatic speech recognition (ASR). The method is based on a minimum number of well-established statistical assumptions; no assumptions are made which are inconsistent with others. The strength of the proposed method is that it allows MMSE estimation of mel-frequency cepstral coefficients (MFCC´s), cepstral mean-subtracted MFCC´s (CMS-MFCC´s), velocity, and acceleration coefficients. Furthermore, the method is easily modified to take into account other compressive non-linearities than the logarithmic which is usually used for MFCC computation. The proposed method shows estimation performance which is identical to or better than state-of-the-art methods. It further shows comparable ASR performance, where the advantage of being able to use mel-frequency speech features based on a power non-linearity rather than a logarithmic is demonstrated.
  • Keywords
    cepstral analysis; mean square error methods; speech recognition; statistical analysis; ASR; MMSE estimation; acceleration coefficients; cepstral mean-subtracted MFCC; compressive nonlinearities; consistent method; mel-frequency cepstral features; minimum mean square error estimation; noise robust automatic speech recognition; power nonlinearity; statistical assumptions; Estimation; Mel frequency cepstral coefficient; Noise; Noise measurement; Speech; Speech recognition; MMSE estimation; cepstral feaures; feature enhancement; power vs. logarithmic non-linearity; robust automatic speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Network Infrastructure and Digital Content (IC-NIDC), 2014 4th IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4799-4736-2
  • Type

    conf

  • DOI
    10.1109/ICNIDC.2014.7000327
  • Filename
    7000327