• DocumentCode
    3492927
  • Title

    On improving trust-region variable projection algorithms for separable nonlinear least squares learning

  • Author

    Mizutani, Eiji ; Demmel, James

  • Author_Institution
    Dept. of Ind. Manage., Nat. Taiwan Univ. of Sci. & Technol., Taipei, Taiwan
  • fYear
    2011
  • fDate
    July 31 2011-Aug. 5 2011
  • Firstpage
    397
  • Lastpage
    404
  • Abstract
    In numerical linear algebra, the variable projection (VP) algorithm has been a standard approach to separable “mixed” linear and nonlinear least squares problems since early 1970s. Such a separable case often arises in diverse contexts of machine learning (e.g., with generalized linear discriminant functions); yet VP is not fully investigated in the literature. We thus describe in detail its implementation issues, highlighting an economical trust-region implementation of VP in the framework of a so-called block-arrow least squares (BA) algorithm for a general multiple-response nonlinear model. We then present numerical results using an exponential-mixture benchmark, seven-bit parity, and color reproduction problems; in some situations, VP enjoys quick convergence and attains high classification rates, while in some others VP works poorly. This observation motivates us to investigate original VP´s strengths and weaknesses compared with other (full-functional) approaches. To overcome the limitation of VP, we suggests how VP can be modified to be a Hessian matrix-based approach that exploits negative curvature when it arises. For this purpose, our economical BA algorithm is very useful in implementing such a modified VP especially when a given model is expressed in a multi-layer (neural) network for efficient Hessian evaluation by the so-called second-order stagewise backpropagation.
  • Keywords
    Hessian matrices; backpropagation; least squares approximations; linear algebra; multilayer perceptrons; BA algorithm; Hessian matrix; block-arrow least square algorithm; color reproduction problem; economical trust-region implementation; exponential-mixture benchmark; linear least square problem; machine learning context; multilayer network; multiple-response nonlinear model; nonlinear least square learning; numerical linear algebra; second-order stagewise backpropagation; trust-region variable projection algorithm; Backpropagation; Barium; Jacobian matrices; Machine learning; Machine learning algorithms; Symmetric matrices; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2011 International Joint Conference on
  • Conference_Location
    San Jose, CA
  • ISSN
    2161-4393
  • Print_ISBN
    978-1-4244-9635-8
  • Type

    conf

  • DOI
    10.1109/IJCNN.2011.6033249
  • Filename
    6033249