مرکز منطقه ای اطلاع رساني علوم و فناوري - Accelerating Newton Optimization for Log-Linear Models through Feature Redundancy

DocumentCode :

3105330

Title :

Accelerating Newton Optimization for Log-Linear Models through Feature Redundancy

Author :

Mathur, Arpit ; Chakrabarti, Soumen

Author_Institution :

IIT Bombay, Mumbai

fYear :

2006

fDate :

18-22 Dec. 2006

Firstpage :

404

Lastpage :

413

Abstract :

Log-linear models are widely used for labeling feature vectors and graphical models, typically to estimate robust conditional distributions in presence of a large number of potentially redundant features. Limited-memory quasi-Newton methods like LBFGS or BLMVM are optimization workhorses for such applications, and most of the training time is spent computing the objective and gradient for the optimizer. We propose a simple technique to speed up the training optimization by clustering features dynamically, and interleaving the standard optimizer with another, coarse-grained, faster optimizer that uses far fewer variables. Experiments with logistic regression training for text classification and conditional random field (CRF) training for information extraction show promising speed-ups between 2times and 9times without any systematic or significant degradation in the quality of the estimated models.

Keywords :

Newton method; feature extraction; gradient methods; optimisation; pattern classification; regression analysis; Newton optimization; conditional random field; feature clustering; feature redundancy; feature vectors; graphical models; information extraction; limited-memory quasiNewton method; log-linear model; logistic regression training; optimizer gradient; text classification; Acceleration; Data mining; Degradation; Graphical models; Interleaved codes; Labeling; Logistics; Optimization methods; Robustness; Text categorization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Data Mining, 2006. ICDM '06. Sixth International Conference on

Conference_Location :

Hong Kong

ISSN :

1550-4786

Print_ISBN :

0-7695-2701-7

Type :

conf

DOI :

10.1109/ICDM.2006.11

Filename :

4053067

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3105330