Title :
Tikhonov regularization for deep neural network acoustic modeling
Author :
Jen-Tzung Chien ; Tsai-Wei Lu
Author_Institution :
Dept. of Electr. & Comput. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
Abstract :
Deep neural network (DNN) has been widely demonstrated to achieve high performance in different speech recognition tasks. This paper focuses on the issue of model regularization in DNN acoustic model. Our idea is to compensate for the perturbations over training samples in the restricted Boltzmann machine (RBM) which is applied as a pre-training stage for unsupervised feature learning and structural modeling. We introduce the Tikhonov regularization in pre-training procedure and pursue the invariance property of objective function over the variations in input samples. This Tikhonov regularization is further combined with the regularization based on weight decay. The error function in supervised cross-entropy training is accordingly reduced. Experimental results on using RM and Aurora4 tasks show that hybrid regularization in RBM pre-training improves the training condition in DNN acoustic model and the robustness in speech recognition performance.
Keywords :
Boltzmann machines; speech recognition; unsupervised learning; DNN acoustic model; RBM; Tikhonov model regularization; deep neural network acoustic modeling; pre-training procedure; restricted Boltzmann machine; speech recognition tasks; structural modeling; supervised cross-entropy training; unsupervised feature learning; Acoustics; Computational modeling; Data models; Hidden Markov models; Speech; Speech recognition; Training; Tikhonov regularization; acoustic model; deep neural network; speech recognition;
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2014 IEEE
DOI :
10.1109/SLT.2014.7078565