DocumentCode
2798853
Title
Auditory model based modified MFCC features
Author
Chatterjee, Saikat ; Kleijn, W. Bastiaan
Author_Institution
ACCESS Linnaeus Center, KTH-R. Inst. of Technol., Stockholm, Sweden
fYear
2010
fDate
14-19 March 2010
Firstpage
4590
Lastpage
4593
Abstract
Using spectral and spectro-temporal auditory models, we develop a computationally simple feature vector based on the design architecture of existing mel frequency cepstral coefficients (MFCCs). Along with the use of an optimized static function to compress a set of filter bank energies, we propose to use a memory-based adaptive compression function to incorporate the behavior of human auditory response across time and frequency. We show that a significant improvement in automatic speech recognition (ASR) performance is obtained for any environmental condition, clean as well as noisy.
Keywords
data compression; hearing; physiological models; speech processing; speech recognition; ASR performance; automatic speech recognition; feature vector; filter bank energy; human auditory response; mel frequency cepstral coefficients; memory based adaptive compression function; modified MFCC features; optimized static function; spectral auditory model; spectro-temporal auditory model; Acceleration; Auditory system; Automatic speech recognition; Computational complexity; Filter bank; Humans; Mel frequency cepstral coefficient; Psychoacoustic models; Signal processing; Time factors; ASR; MFCC; auditory model;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location
Dallas, TX
ISSN
1520-6149
Print_ISBN
978-1-4244-4295-9
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2010.5495557
Filename
5495557
Link To Document