Title :
An RNN-based channel classification for Mandarin speech recognition over GSM/PSTN transmission environments
Author_Institution :
Advanced Technology Center/CCL, Industrial Technology Research Institute, Hsinchu, Taiwan
Abstract :
This paper is concerned with adopting an RNN (Recurrent Neural Network)-based channel classification technique for improving the robustness of speech recognition over GSM (Global System for Mobile Communication) and PSTN (Public Switched Telephone Network) transmission channels. We apply the RNN-based channel classification to select a most likely HMM from pre-trained HMMs that are trained for each specific channel environment. A broad-class discrimination is incorporated into the RNN-based channel classification by rejecting the disturbed frames of testing speech for improving the performance. By applying the proposed technique we obtained a drop on the average word error rate by about 24% for the recognition of the abbreviated Taiwan stock names over the conventional HMM-based scheme. Experimental results show it is an efficient framework to enhance the robustness across different channel environments.
Keywords :
Databases; Recurrent neural networks; Robustness; Signal to noise ratio; Speech; Speech recognition; Testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
Print_ISBN :
0-7803-7402-9
DOI :
10.1109/ICASSP.2002.5743971