Title :
Convergence Study in Extended Kalman Filter-Based Training of Recurrent Neural Networks
Author :
Wang, Xiaoyu ; Huang, Yong
Author_Institution :
Emerging Pathogens Inst., Univ. of Florida, Gainesville, FL, USA
fDate :
4/1/2011 12:00:00 AM
Abstract :
Recurrent neural network (RNN) has emerged as a promising tool in modeling nonlinear dynamical systems, but the training convergence is still of concern. This paper aims to develop an effective extended Kalman filter-based RNN training approach with a controllable training convergence. The training convergence problem during extended Kalman filter-based RNN training has been proposed and studied by adapting two artificial training noise parameters: the covariance of measurement noise (R) and the covariance of process noise (Q) of Kalman filter. The R and Q adaption laws have been developed using the Lyapunov method and the maximum likelihood method, respectively. The effectiveness of the proposed adaption laws has been tested using a nonlinear dynamical benchmark system and further applied in cutting tool wear modeling. The results show that the R adaption law can effectively avoid the divergence problem and ensure the training convergence, whereas the Q adaption law helps improve the training convergence speed.
Keywords :
Kalman filters; Lyapunov methods; convergence; maximum likelihood estimation; nonlinear filters; recurrent neural nets; Lyapunov method; adaption laws; artificial training noise parameters; cutting tool wear modeling; extended Kalman filter-based training; maximum likelihood method; measurement noise covariance; nonlinear systems; process noise covariance; recurrent neural networks; training convergence speed; Artificial neural networks; Convergence; Kalman filters; Neurons; Noise; Recurrent neural networks; Training; Adaptive training algorithm; extended Kalman filter; recurrent neural network; training convergence; Artificial Intelligence; Computer Simulation; Humans; Learning; Likelihood Functions; Neural Networks (Computer); Nonlinear Dynamics; Recurrence;
Journal_Title :
Neural Networks, IEEE Transactions on
DOI :
10.1109/TNN.2011.2109737