DocumentCode :
3188354
Title :
Can Non-Linear Readout Nodes Enhance the Performance of Reservoir-Based Speech Recognizers?
Author :
Triefenbach, Fabian ; Martens, Jean-Pierre
Author_Institution :
Dept. of Electron. & Inf. Syst., Ghent Univ., Ghent, Belgium
fYear :
2011
fDate :
12-14 Dec. 2011
Firstpage :
262
Lastpage :
267
Abstract :
It has been shown for some time that a Recurrent Neural Network (RNN) can perform an accurate acoustic-phonetic decoding of a continuous speech stream. However, the error back-propagation through time (EBPTT) training of such a network is often critical (bad local optimum) and very time consuming. These problems hamper the deployment of sufficiently large networks that would be able to outperform state-of-the-art Hidden Markov Models. To overcome this drawback of RNNs, we recently proposed to employ a large pool of recurrently connected non-linear nodes (a so-called reservoir) with fixed weights, and to map the reservoir outputs to meaningful phonemic classes by means of a layer of linear output nodes (called the readout nodes) whose weights form the solution of a set of linear equations. In this paper, we collect experimental evidence that the performance of a reservoir-based system can be enhanced by working with non-linear readout nodes. Although this calls for an iterative training, it boils down to a non-linear regression which seems to be less critical and time consuming than EBPTT.
Keywords :
hidden Markov models; learning (artificial intelligence); recurrent neural nets; speech recognition; training; EBPTT training; RNN; acoustic-phonetic decoding; continuous speech stream; error back-propagation through time; hidden Markov models; iterative training; linear equations; linear output nodes; nonlinear readout nodes; nonlinear regression; performance enhancement; phonemic classes; recurrent neural network; recurrently connected nonlinear nodes; reservoir-based speech recognizers; reservoir-based system; Accuracy; Error analysis; Logistics; Reservoirs; Speech recognition; Training; Vectors; Automatic Speech Recognition; Linear Regression; Logistic Regression; On-line Training; Recurrent Neural Networks; Reservoir Computing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Informatics and Computational Intelligence (ICI), 2011 First International Conference on
Conference_Location :
Bandung
Print_ISBN :
978-1-4673-0091-9
Type :
conf
DOI :
10.1109/ICI.2011.50
Filename :
6141682
Link To Document :
بازگشت