• DocumentCode
    2361789
  • Title

    Adaptive regularization

  • Author

    Hansen, L.K. ; Rasmussen, C.E. ; Svarer, C. ; Larsen, J.

  • Author_Institution
    Electron. Inst., Tech. Univ. Denmark, Lyngby, Denmark
  • fYear
    1994
  • fDate
    6-8 Sep 1994
  • Firstpage
    78
  • Lastpage
    87
  • Abstract
    Regularization, e.g., in the form of weight decay, is important for training and optimization of neural network architectures. In this work the authors provide a tool based on asymptotic sampling theory, for iterative estimation of weight decay parameters. The basic idea is to do a gradient descent in the estimated generalization error with respect to the regularization parameters. The scheme is implemented in the authors´ Designer Net framework for network training and pruning, i.e., is based on the diagonal Hessian approximation. The scheme does not require essential computational overhead in addition to what is needed for training and pruning. The viability of the approach is demonstrated in an experiment concerning prediction of the chaotic Mackey-Glass series. The authors find that the optimized weight decays are relatively large for densely connected networks in the initial pruning phase, while they decrease as pruning proceeds
  • Keywords
    Hessian matrices; iterative methods; learning (artificial intelligence); neural net architecture; neural nets; parameter estimation; statistical analysis; Designer Net framework; adaptive regularization; asymptotic sampling theory; chaotic Mackey-Glass series; densely connected networks; diagonal Hessian approximation; gradient descent; iterative estimation; network training; neural network architectures; pruning; weight decay; Biological neural networks; Chaos; Computer errors; Delay lines; Estimation theory; Feedforward systems; Optimization methods; Sampling methods; Statistical analysis; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks for Signal Processing [1994] IV. Proceedings of the 1994 IEEE Workshop
  • Conference_Location
    Ermioni
  • Print_ISBN
    0-7803-2026-3
  • Type

    conf

  • DOI
    10.1109/NNSP.1994.366061
  • Filename
    366061