DocumentCode :
3492927
Title :
On improving trust-region variable projection algorithms for separable nonlinear least squares learning
Author :
Mizutani, Eiji ; Demmel, James
Author_Institution :
Dept. of Ind. Manage., Nat. Taiwan Univ. of Sci. & Technol., Taipei, Taiwan
fYear :
2011
fDate :
July 31 2011-Aug. 5 2011
Firstpage :
397
Lastpage :
404
Abstract :
In numerical linear algebra, the variable projection (VP) algorithm has been a standard approach to separable “mixed” linear and nonlinear least squares problems since early 1970s. Such a separable case often arises in diverse contexts of machine learning (e.g., with generalized linear discriminant functions); yet VP is not fully investigated in the literature. We thus describe in detail its implementation issues, highlighting an economical trust-region implementation of VP in the framework of a so-called block-arrow least squares (BA) algorithm for a general multiple-response nonlinear model. We then present numerical results using an exponential-mixture benchmark, seven-bit parity, and color reproduction problems; in some situations, VP enjoys quick convergence and attains high classification rates, while in some others VP works poorly. This observation motivates us to investigate original VP´s strengths and weaknesses compared with other (full-functional) approaches. To overcome the limitation of VP, we suggests how VP can be modified to be a Hessian matrix-based approach that exploits negative curvature when it arises. For this purpose, our economical BA algorithm is very useful in implementing such a modified VP especially when a given model is expressed in a multi-layer (neural) network for efficient Hessian evaluation by the so-called second-order stagewise backpropagation.
Keywords :
Hessian matrices; backpropagation; least squares approximations; linear algebra; multilayer perceptrons; BA algorithm; Hessian matrix; block-arrow least square algorithm; color reproduction problem; economical trust-region implementation; exponential-mixture benchmark; linear least square problem; machine learning context; multilayer network; multiple-response nonlinear model; nonlinear least square learning; numerical linear algebra; second-order stagewise backpropagation; trust-region variable projection algorithm; Backpropagation; Barium; Jacobian matrices; Machine learning; Machine learning algorithms; Symmetric matrices; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), The 2011 International Joint Conference on
Conference_Location :
San Jose, CA
ISSN :
2161-4393
Print_ISBN :
978-1-4244-9635-8
Type :
conf
DOI :
10.1109/IJCNN.2011.6033249
Filename :
6033249
Link To Document :
بازگشت