Title :
Noise-robust automatic speech recognition using a discriminative echo state network
Author :
Skowronski, Mark D. ; Harris, John G.
Author_Institution :
Dept. of Electr. & Comput. Eng., Florida Univ., Gainesville, FL
Abstract :
The echo state network (ESN) is a recurrent neural network proposed by Herbert Jaeger with a simplified training routine. Previously, we have demonstrated the noise-robust performance of the predictive ESN classifier in automatic speech recognition experiments in which the network was trained to predict the next frame of speech features. Classification performance was limited because the predictive models lacked discriminability, so we changed the model output to a one-of-many output encoding scheme and trained the model discriminatively. Performance was compared to a hidden Markov model (HMM) in small-vocabulary ASR experiments with additive noise. Accuracy of 50% was achieved by a discriminative ESN classifier at -8.5 dB SNR, compared to 0.4 dB SNR for a predictive ESN classifier and 6.6 dB SNR for an HMM. With discriminative training, a larger reservoir was employed for the discriminative ESN classifier compared to the predictive ESN classifier which resulted a larger memory depth and more noise-robust performance.
Keywords :
hidden Markov models; learning (artificial intelligence); recurrent neural nets; speech recognition; additive noise; classification performance; discriminative echo state network; discriminative training; hidden Markov model; noise-robust automatic speech recognition; one-of-many output encoding scheme; predictive models; recurrent neural network; small-vocabulary ASR experiments; Acoustic noise; Additive noise; Automatic speech recognition; Finite impulse response filter; Hidden Markov models; Noise robustness; Predictive models; Reservoirs; Signal to noise ratio; Testing;
Conference_Titel :
Circuits and Systems, 2007. ISCAS 2007. IEEE International Symposium on
Conference_Location :
New Orleans, LA
Print_ISBN :
1-4244-0920-9
Electronic_ISBN :
1-4244-0921-7
DOI :
10.1109/ISCAS.2007.378015