Title :
Gammatone Features and Feature Combination for Large Vocabulary Speech Recognition
Author :
Schluter, Ralf ; Bezrukov, L. ; Wagner, Hannes ; Ney, Hermann
Author_Institution :
Dept. of Comput. Sci., RWTH Aachen Univ., Germany
Abstract :
In this work, an acoustic feature set based on a gammatone filterbank is introduced for large vocabulary speech recognition. The gammatone features presented here lead to competitive results on the EPPS English task, and considerable improvements were obtained by subsequent combination to a number of standard acoustic features, i.e. MFCC, PLP, MF-PLP, and VTLN plus voicedness. Best results were obtained when combining gammatone features to all other features using weighted ROVER, resulting in a relative improvement of about 12% in word error rate compared to the best single feature system. We also found that ROVER gives better results for feature combination than both log-linear model combination and LDA.
Keywords :
channel bank filters; feature extraction; speech processing; speech recognition; EPPS English; acoustic feature; feature combination; gammatone features; gammatone filterbank; large vocabulary speech recognition; word error rate; Biology; Cepstral analysis; Computer science; Feature extraction; Filter bank; Frequency; Humans; IIR filters; Speech recognition; Vocabulary; acoustic feature combination; auditory systems; feature extraction; gammatone filterbank; speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.366996