DocumentCode
730712
Title
Deep recurrent regularization neural network for speech recognition
Author
Jen-Tzung Chien ; Tsai-Wei Lu
Author_Institution
Dept. of Electr. & Comput. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
fYear
2015
fDate
19-24 April 2015
Firstpage
4560
Lastpage
4564
Abstract
This paper presents a deep recurrent regularization neural network (DRRNN) for speech recognition. Our idea is to build a regularization neural network acoustic model by conducting the hybrid Tikhonov and weight-decay regularization which compensates the variations due to the input speech as well as the model parameters in the restricted Boltzmann machine as a pre-training stage for feature learning and structural modeling. In addition, a new backpropagation through time (BPTT) algorithm is developed by extending the truncated minibatch training for recurrent neural network where the minibatch BPTT is not only performed in recurrent layer but also in feedforward layer. The DRRNN acoustic model is accordingly established to capture the temporal correlation in a regularization neural network. Experimental results on the tasks of RM and Aurora4 show the effectiveness and robustness of using DRRNN for speech recognition.
Keywords
Boltzmann machines; speech recognition; BPTT algorithm; DRRNN acoustic model; backpropagation through time algorithm; deep recurrent regularization neural network acoustic model; hybrid Tikhonov and weight-decay regularization; restricted Boltzmann machine; speech recognition; truncated minibatch training; Acoustics; Hidden Markov models; Neurons; Recurrent neural networks; Speech; Training; Recurrent neural network; acoustic model; deep learning; model regularization;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178834
Filename
7178834
Link To Document