Title :
Emulating temporal receptive fields of auditory mid-brain neurons for automatic speech recognition
Author :
Sivaram, G.S.V.S. ; Hermansky, Hynek
Author_Institution :
IDIAP Res. Inst., Swiss Fed. Inst. of Technol. at Lausanne, Lausanne, Switzerland
Abstract :
This paper proposes modifications to the Multi-resolution RASTA (MRASTA) feature extraction technique for the automatic speech recognition (ASR). By emulating asymmetries of the temporal receptive field (TRF) profiles of auditory mid-brain neurons, we obtain more than 13% relative improvement in word error rate on OGI-Digits database. Experiments on TIMIT database confirm that proposed modifications are indeed useful.
Keywords :
auditory evoked potentials; brain; feature extraction; speech recognition; MRASTA; OGI-Digits database; TIMIT database; TRF; auditory mid-brain neurons; automatic speech recognition; feature extraction; multiresolution RASTA; temporal receptive fields; word error rate; Databases; Feature extraction; Finite impulse response filters; Neurons; Speech; Time-frequency analysis; Trajectory;
Conference_Titel :
Signal Processing Conference, 2008 16th European
Conference_Location :
Lausanne