DocumentCode
3530367
Title
An auditory-based feature for robust speech recognition
Author
Shao, Yang ; Jin, Zhaozhang ; Wang, DeLiang ; Srinivasan, Soundararajan
Author_Institution
Comput. Sci. & Eng. Dept., Ohio State Univ., Columbus, OH
fYear
2009
fDate
19-24 April 2009
Firstpage
4625
Lastpage
4628
Abstract
A conventional automatic speech recognizer does not perform well in the presence of noise, while human listeners are able to segregate and recognize speech in noisy conditions. We study a novel feature based on an auditory periphery model for robust speech recognition. Specifically, gammatone frequency cepstral coefficients are derived by applying a cepstral analysis on gammatone filterbank responses. Our evaluations show that the proposed feature performs considerably better than conventional acoustic features. We further demonstrate that integrating the proposed feature with a computational auditory scene analysis system yields promising recognition performance.
Keywords
cepstral analysis; feature extraction; filtering theory; speech recognition; acoustic feature; auditory periphery model; auditory-based feature; cepstral analysis; computational auditory scene analysis system; gammatone filterbank response; gammatone frequency cepstral coefficient; speech recognition; Acoustic noise; Automatic speech recognition; Cepstral analysis; Filter bank; Frequency; Humans; Noise robustness; Performance evaluation; Speech enhancement; Speech recognition; Robust speech recognition; auditory feature; computational auditory scene analysis; gammatone frequency cepstral coefficients;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960661
Filename
4960661
Link To Document