Title : 
On sparsity-exploiting memory-efficient trust-region regularized nonlinear least squares algorithms for neural-network learning
         
        
            Author : 
Mizutani, Eiji ; Demmel, James W.
         
        
            Author_Institution : 
Dept. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan
         
        
        
        
        
        
            Abstract : 
This paper highlights nonlinear least squares algorithms with trust-region regularization for multiple-output neural-network (NN) models, describing how special structures of the "block-angular" residual Jacobian matrix and the "block-arrow" Gauss-Newton Hessian (or Fisher information matrix) can be exploited to render a large class of NN-learning algorithms "efficient" in both memory and operation counts. In simulation, we demonstrate both direct and iterative trust-region algorithms with two distinct nonlinear models: "multilayer perceptrons (MLP)" and "complementary mixtures of NN-experts" (or neuro-fuzzy modular networks) using a relatively large real-world nonlinear regression application.
         
        
            Keywords : 
Jacobian matrices; learning (artificial intelligence); least squares approximations; multilayer perceptrons; neural nets; nonlinear systems; Fisher information matrix; MLP; block-angular residual Jacobian matrix; block-arrow Gauss-Newton Hessian matrix; complementary mixtures; iterative trust-region algorithms; multilayer perceptrons; multiple-output neural-network; neural-network learning; neuro fuzzy modular networks; real-world nonlinear regression application; sparsity-exploiting memory-efficient trust region; trust region regularized nonlinear least squares; Character generation; Computer science; Iterative algorithms; Iterative methods; Jacobian matrices; Least squares approximation; Least squares methods; Neural networks; Newton method; Recursive estimation;
         
        
        
        
            Conference_Titel : 
Neural Networks, 2003. Proceedings of the International Joint Conference on
         
        
        
            Print_ISBN : 
0-7803-7898-9
         
        
        
            DOI : 
10.1109/IJCNN.2003.1223351