Title :
Levenberg-Marquardt algorithm with adaptive momentum for the efficient training of feedforward networks
Author :
Ampazis, N. ; Perantonis, S.J.
Author_Institution :
Inst. of Inf. & Telecommun., Nat. Center for Sci. Res. DEMOKRITOS, Athens, Greece
Abstract :
We present a highly efficient second order algorithm for the training of feedforward neural networks. The algorithm is based on iterations of the form employed in the Levenberg-Marquardt (LM) method for nonlinear least squares problems with the inclusion of an additional adaptive momentum term arising from the formulation of the training task as a constrained optimization problem. Its implementation requires minimal additional computations compared to a standard LM iteration which are compensated, however, from its excellent convergence properties. Simulations of large scale classical neural network benchmarks are presented which reveal the power of the method to obtain solutions in difficult problems whereas other standard second order techniques (including LM) fail to converge
Keywords :
Hessian matrices; Jacobian matrices; conjugate gradient methods; convergence; feedforward neural nets; learning (artificial intelligence); mean square error methods; optimisation; Levenberg-Marquardt algorithm; adaptive momentum; constrained optimization problem; convergence properties; efficient training; second order algorithm; Artificial neural networks; Character generation; Constraint optimization; Cost function; Electronic mail; Feedforward neural networks; Informatics; Jacobian matrices; Least squares methods; Neural networks;
Conference_Titel :
Neural Networks, 2000. IJCNN 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on
Conference_Location :
Como
Print_ISBN :
0-7695-0619-4
DOI :
10.1109/IJCNN.2000.857825