Title :
Gammatone wavelet Cepstral Coefficients for robust speech recognition
Author :
Adiga, Aniruddha ; Magimai, Mathew ; Seelamantula, Chandra Sekhar
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Sci., Bangalore, India
Abstract :
We develop noise robust features using Gammatone wavelets derived from the popular Gammatone functions. These wavelets incorporate the characteristics of human peripheral auditory systems, in particular the spatially-varying frequency response of the basilar membrane. We refer to the new features as Gammatone Wavelet Cepstral Coefficients (GWCC). The procedure involved in extracting GWCC from a speech signal is similar to that of the conventional Mel-Frequency Cepstral Coefficients (MFCC) technique, with the difference being in the type of filterbank used. We replace the conventional mel filterbank in MFCC with a Gammatone wavelet filterbank, which we construct using Gammatone wavelets. We also explore the effect of Gammatone filterbank based features (Gammatone Cepstral Coefficients (GCC)) for robust speech recognition. On AURORA 2 database, a comparison of GWCCs and GCCs with MFCCs shows that Gammatone based features yield a better recognition performance at low SNRs.
Keywords :
audio databases; speech recognition; wavelet transforms; AURORA 2 database; GWCC; Gammatone functions; Gammatone wavelet cepstral coefficients; MFCC technique; Mel-frequency cepstral coefficients; basilar membrane; human peripheral auditory systems; noise robust features; robust speech recognition; speech signal; Feature extraction; Mel frequency cepstral coefficient; Speech; Speech recognition; Wavelet transforms; Auditory modeling; Cepstral coefficients; Gammatone wavelets; Speech recognition;
Conference_Titel :
TENCON 2013 - 2013 IEEE Region 10 Conference (31194)
Conference_Location :
Xi´an
Print_ISBN :
978-1-4799-2825-5
DOI :
10.1109/TENCON.2013.6718948