DocumentCode :
3530367
Title :
An auditory-based feature for robust speech recognition
Author :
Shao, Yang ; Jin, Zhaozhang ; Wang, DeLiang ; Srinivasan, Soundararajan
Author_Institution :
Comput. Sci. & Eng. Dept., Ohio State Univ., Columbus, OH
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
4625
Lastpage :
4628
Abstract :
A conventional automatic speech recognizer does not perform well in the presence of noise, while human listeners are able to segregate and recognize speech in noisy conditions. We study a novel feature based on an auditory periphery model for robust speech recognition. Specifically, gammatone frequency cepstral coefficients are derived by applying a cepstral analysis on gammatone filterbank responses. Our evaluations show that the proposed feature performs considerably better than conventional acoustic features. We further demonstrate that integrating the proposed feature with a computational auditory scene analysis system yields promising recognition performance.
Keywords :
cepstral analysis; feature extraction; filtering theory; speech recognition; acoustic feature; auditory periphery model; auditory-based feature; cepstral analysis; computational auditory scene analysis system; gammatone filterbank response; gammatone frequency cepstral coefficient; speech recognition; Acoustic noise; Automatic speech recognition; Cepstral analysis; Filter bank; Frequency; Humans; Noise robustness; Performance evaluation; Speech enhancement; Speech recognition; Robust speech recognition; auditory feature; computational auditory scene analysis; gammatone frequency cepstral coefficients;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960661
Filename :
4960661
Link To Document :
بازگشت