DocumentCode
2746292
Title
Second-order backpropagation algorithms for a stagewise-partitioned separable Hessian matrix
Author
Mizutani, Eiji ; Dreyfus, Stuart E. ; Demmel, James W.
Author_Institution
Dept. of Comput. Sci., Tsing Hua Univ., Hsinchu, Taiwan
Volume
2
fYear
2005
fDate
31 July-4 Aug. 2005
Firstpage
1027
Abstract
Recent advances in computer technology allows the implementation of some important methods that were assigned lower priority in the past due to their computational burdens. Second-order backpropagation (BP) is such a method that computes the exact Hessian matrix of a given objective function. We describe two algorithms for feed-forward neural-network (NN) learning with emphasis on how to organize Hessian elements into a so-called stagewise-partitioned block-arrow matrix form: (1) stagewise BP, an extension of the discrete-time optimal-control stagewise Newton of Dreyfus 1966; and (2) nodewise BP, based on direct implementation of the chain rule for differentiation attributable to Bishop 1992. The former, a more systematic and cost-efficient implementation in both memory and operation, progresses in the same layer-by-layer (i.e., stagewise) fashion as the widely-employed first-order BP computes the gradient vector. We also show intriguing separable structures of each block in the partitioned Hessian, disclosing the rank of blocks.
Keywords
Hessian matrices; backpropagation; feedforward neural nets; discrete-time optimal-control; feedforward neural network learning; gradient vector; second-order backpropagation algorithms; stagewise-partitioned block-arrow matrix form; stagewise-partitioned separable Hessian matrix; Backpropagation algorithms; Computer science; Feedforward systems; Industrial engineering; Mathematics; Multi-layer neural network; Neural networks; Operations research; Optimal control; Optimization methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2005. IJCNN '05. Proceedings. 2005 IEEE International Joint Conference on
Print_ISBN
0-7803-9048-2
Type
conf
DOI
10.1109/IJCNN.2005.1555994
Filename
1555994
Link To Document