• DocumentCode
    3105330
  • Title

    Accelerating Newton Optimization for Log-Linear Models through Feature Redundancy

  • Author

    Mathur, Arpit ; Chakrabarti, Soumen

  • Author_Institution
    IIT Bombay, Mumbai
  • fYear
    2006
  • fDate
    18-22 Dec. 2006
  • Firstpage
    404
  • Lastpage
    413
  • Abstract
    Log-linear models are widely used for labeling feature vectors and graphical models, typically to estimate robust conditional distributions in presence of a large number of potentially redundant features. Limited-memory quasi-Newton methods like LBFGS or BLMVM are optimization workhorses for such applications, and most of the training time is spent computing the objective and gradient for the optimizer. We propose a simple technique to speed up the training optimization by clustering features dynamically, and interleaving the standard optimizer with another, coarse-grained, faster optimizer that uses far fewer variables. Experiments with logistic regression training for text classification and conditional random field (CRF) training for information extraction show promising speed-ups between 2times and 9times without any systematic or significant degradation in the quality of the estimated models.
  • Keywords
    Newton method; feature extraction; gradient methods; optimisation; pattern classification; regression analysis; Newton optimization; conditional random field; feature clustering; feature redundancy; feature vectors; graphical models; information extraction; limited-memory quasiNewton method; log-linear model; logistic regression training; optimizer gradient; text classification; Acceleration; Data mining; Degradation; Graphical models; Interleaved codes; Labeling; Logistics; Optimization methods; Robustness; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2006. ICDM '06. Sixth International Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1550-4786
  • Print_ISBN
    0-7695-2701-7
  • Type

    conf

  • DOI
    10.1109/ICDM.2006.11
  • Filename
    4053067