• DocumentCode
    396663
  • Title

    An efficient learning algorithm with second-order convergence for multilayer neural networks

  • Author

    Ninomiya, Hiroshi ; Tomita, Chikahiro ; Asai, Hideki

  • Author_Institution
    Dept. of Inf. Sci., Shonan Inst. of Technol., Fujisawa, Japan
  • Volume
    3
  • fYear
    2003
  • fDate
    20-24 July 2003
  • Firstpage
    2028
  • Abstract
    This paper describes an efficient second-order algorithm for learning of the multilayer neural networks with widely and stable convergent properties. First, the algorithm based on iterative formula of the steepest descent method, which is "implicitly" employed, is introduced. We show the equivalent property between the Gauss-Newton(GN) method and the "implicit" steepest descent (ISD) method. This means that ISD method satisfy the desired targets by simultaneously combining the merits of the GN and SD techniques in order to enhance the very good properties of SD method. Next, we propose very powerful algorithm for learning multilayer feedforward neural networks, called "implicit" steepest descent with momentum (ISDM) method and show the analogy with the trapezoidal formula in the field of numerical analysis. Finally, the proposed algorithms are compared with GN method for training multilayer neural networks through the computer simulations.
  • Keywords
    Newton method; convergence of numerical methods; feedforward neural nets; iterative methods; learning (artificial intelligence); GN method; Gauss-Newton method; ISD method; ISDM method; computer simulation; implicit steepest descent method; implicit steepest descent with momentum method; iterative formula; learning algorithm; multilayer feedforward neural networks; numerical analysis; second-order algorithm; second-order convergence; training multilayer neural networks; trapezoidal formula; Artificial neural networks; Convergence; Gaussian processes; Iterative algorithms; Least squares methods; Multi-layer neural network; Neural networks; Newton method; Recursive estimation; Systems engineering and theory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2003. Proceedings of the International Joint Conference on
  • ISSN
    1098-7576
  • Print_ISBN
    0-7803-7898-9
  • Type

    conf

  • DOI
    10.1109/IJCNN.2003.1223719
  • Filename
    1223719