An auditory-based feature for robust speech recognition

Author

Shao, Yang ; Jin, Zhaozhang ; Wang, DeLiang ; Srinivasan, Soundararajan

Author_Institution

Comput. Sci. & Eng. Dept., Ohio State Univ., Columbus, OH

fYear

2009

fDate

19-24 April 2009

Firstpage

4625

Lastpage

4628

Abstract

A conventional automatic speech recognizer does not perform well in the presence of noise, while human listeners are able to segregate and recognize speech in noisy conditions. We study a novel feature based on an auditory periphery model for robust speech recognition. Specifically, gammatone frequency cepstral coefficients are derived by applying a cepstral analysis on gammatone filterbank responses. Our evaluations show that the proposed feature performs considerably better than conventional acoustic features. We further demonstrate that integrating the proposed feature with a computational auditory scene analysis system yields promising recognition performance.

Keywords

cepstral analysis; feature extraction; filtering theory; speech recognition; acoustic feature; auditory periphery model; auditory-based feature; cepstral analysis; computational auditory scene analysis system; gammatone filterbank response; gammatone frequency cepstral coefficient; speech recognition; Acoustic noise; Automatic speech recognition; Cepstral analysis; Filter bank; Frequency; Humans; Noise robustness; Performance evaluation; Speech enhancement; Speech recognition; Robust speech recognition; auditory feature; computational auditory scene analysis; gammatone frequency cepstral coefficients;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on

Conference_Location

Taipei

ISSN

1520-6149

Print_ISBN

978-1-4244-2353-8

Electronic_ISBN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2009.4960661

Filename

4960661