• DocumentCode
    2798853
  • Title

    Auditory model based modified MFCC features

  • Author

    Chatterjee, Saikat ; Kleijn, W. Bastiaan

  • Author_Institution
    ACCESS Linnaeus Center, KTH-R. Inst. of Technol., Stockholm, Sweden
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    4590
  • Lastpage
    4593
  • Abstract
    Using spectral and spectro-temporal auditory models, we develop a computationally simple feature vector based on the design architecture of existing mel frequency cepstral coefficients (MFCCs). Along with the use of an optimized static function to compress a set of filter bank energies, we propose to use a memory-based adaptive compression function to incorporate the behavior of human auditory response across time and frequency. We show that a significant improvement in automatic speech recognition (ASR) performance is obtained for any environmental condition, clean as well as noisy.
  • Keywords
    data compression; hearing; physiological models; speech processing; speech recognition; ASR performance; automatic speech recognition; feature vector; filter bank energy; human auditory response; mel frequency cepstral coefficients; memory based adaptive compression function; modified MFCC features; optimized static function; spectral auditory model; spectro-temporal auditory model; Acceleration; Auditory system; Automatic speech recognition; Computational complexity; Filter bank; Humans; Mel frequency cepstral coefficient; Psychoacoustic models; Signal processing; Time factors; ASR; MFCC; auditory model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495557
  • Filename
    5495557