Title :
Training recurrent network with block-diagonal approximated Levenberg-Marquardt algorithm
Author :
Chan, Lai-Wan ; Szeto, Chi-Cheong
Author_Institution :
Comput. Sci. & Eng. Dept., Chinese Univ. of Hong Kong, Shatin, Hong Kong
Abstract :
We propose the block-diagonal matrix to approximate the Hessian matrix in the Levenberg-Marquardt method in the training of neural networks. Two weight updating strategies, namely asynchronous and synchronous updating methods, were investigated. Asynchronous method updates weights of one block at a time while synchronous method updates all weights at the same time. Variations of these two methods, which involves the determination of the parameters μ and λ, are examined
Keywords :
Hessian matrices; approximation theory; learning (artificial intelligence); recurrent neural nets; synchronisation; Hessian matrix; Levenberg-Marquardt algorithm; asynchronous updating; block-diagonal matrix; learning algorithm; recurrent neural network; synchronous updating; Backpropagation; Computer science; Decoding; Difference equations; Differential equations; Feedforward neural networks; Neural networks; Neurons; Recurrent neural networks; Stress;
Conference_Titel :
Neural Networks, 1999. IJCNN '99. International Joint Conference on
Conference_Location :
Washington, DC
Print_ISBN :
0-7803-5529-6
DOI :
10.1109/IJCNN.1999.832595