Auditory model based modified MFCC features

Author

Chatterjee, Saikat ; Kleijn, W. Bastiaan

Author_Institution

ACCESS Linnaeus Center, KTH-R. Inst. of Technol., Stockholm, Sweden

fYear

2010

fDate

14-19 March 2010

Firstpage

4590

Lastpage

4593

Abstract

Using spectral and spectro-temporal auditory models, we develop a computationally simple feature vector based on the design architecture of existing mel frequency cepstral coefficients (MFCCs). Along with the use of an optimized static function to compress a set of filter bank energies, we propose to use a memory-based adaptive compression function to incorporate the behavior of human auditory response across time and frequency. We show that a significant improvement in automatic speech recognition (ASR) performance is obtained for any environmental condition, clean as well as noisy.

Keywords

data compression; hearing; physiological models; speech processing; speech recognition; ASR performance; automatic speech recognition; feature vector; filter bank energy; human auditory response; mel frequency cepstral coefficients; memory based adaptive compression function; modified MFCC features; optimized static function; spectral auditory model; spectro-temporal auditory model; Acceleration; Auditory system; Automatic speech recognition; Computational complexity; Filter bank; Humans; Mel frequency cepstral coefficient; Psychoacoustic models; Signal processing; Time factors; ASR; MFCC; auditory model;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on

Conference_Location

Dallas, TX

ISSN

1520-6149

Print_ISBN

978-1-4244-4295-9

Electronic_ISBN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2010.5495557

Filename

5495557