DocumentCode
396663
Title
An efficient learning algorithm with second-order convergence for multilayer neural networks
Author
Ninomiya, Hiroshi ; Tomita, Chikahiro ; Asai, Hideki
Author_Institution
Dept. of Inf. Sci., Shonan Inst. of Technol., Fujisawa, Japan
Volume
3
fYear
2003
fDate
20-24 July 2003
Firstpage
2028
Abstract
This paper describes an efficient second-order algorithm for learning of the multilayer neural networks with widely and stable convergent properties. First, the algorithm based on iterative formula of the steepest descent method, which is "implicitly" employed, is introduced. We show the equivalent property between the Gauss-Newton(GN) method and the "implicit" steepest descent (ISD) method. This means that ISD method satisfy the desired targets by simultaneously combining the merits of the GN and SD techniques in order to enhance the very good properties of SD method. Next, we propose very powerful algorithm for learning multilayer feedforward neural networks, called "implicit" steepest descent with momentum (ISDM) method and show the analogy with the trapezoidal formula in the field of numerical analysis. Finally, the proposed algorithms are compared with GN method for training multilayer neural networks through the computer simulations.
Keywords
Newton method; convergence of numerical methods; feedforward neural nets; iterative methods; learning (artificial intelligence); GN method; Gauss-Newton method; ISD method; ISDM method; computer simulation; implicit steepest descent method; implicit steepest descent with momentum method; iterative formula; learning algorithm; multilayer feedforward neural networks; numerical analysis; second-order algorithm; second-order convergence; training multilayer neural networks; trapezoidal formula; Artificial neural networks; Convergence; Gaussian processes; Iterative algorithms; Least squares methods; Multi-layer neural network; Neural networks; Newton method; Recursive estimation; Systems engineering and theory;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2003. Proceedings of the International Joint Conference on
ISSN
1098-7576
Print_ISBN
0-7803-7898-9
Type
conf
DOI
10.1109/IJCNN.2003.1223719
Filename
1223719
Link To Document