Title :
An efficient learning algorithm with second-order convergence for multilayer neural networks
Author :
Ninomiya, Hiroshi ; Tomita, Chikahiro ; Asai, Hideki
Author_Institution :
Dept. of Inf. Sci., Shonan Inst. of Technol., Fujisawa, Japan
Abstract :
This paper describes an efficient second-order algorithm for learning of the multilayer neural networks with widely and stable convergent properties. First, the algorithm based on iterative formula of the steepest descent method, which is "implicitly" employed, is introduced. We show the equivalent property between the Gauss-Newton(GN) method and the "implicit" steepest descent (ISD) method. This means that ISD method satisfy the desired targets by simultaneously combining the merits of the GN and SD techniques in order to enhance the very good properties of SD method. Next, we propose very powerful algorithm for learning multilayer feedforward neural networks, called "implicit" steepest descent with momentum (ISDM) method and show the analogy with the trapezoidal formula in the field of numerical analysis. Finally, the proposed algorithms are compared with GN method for training multilayer neural networks through the computer simulations.
Keywords :
Newton method; convergence of numerical methods; feedforward neural nets; iterative methods; learning (artificial intelligence); GN method; Gauss-Newton method; ISD method; ISDM method; computer simulation; implicit steepest descent method; implicit steepest descent with momentum method; iterative formula; learning algorithm; multilayer feedforward neural networks; numerical analysis; second-order algorithm; second-order convergence; training multilayer neural networks; trapezoidal formula; Artificial neural networks; Convergence; Gaussian processes; Iterative algorithms; Least squares methods; Multi-layer neural network; Neural networks; Newton method; Recursive estimation; Systems engineering and theory;
Conference_Titel :
Neural Networks, 2003. Proceedings of the International Joint Conference on
Print_ISBN :
0-7803-7898-9
DOI :
10.1109/IJCNN.2003.1223719