• DocumentCode
    671675
  • Title

    A hybrid optimization method for acceleration of building linear classification models

  • Author

    Junchao Lv ; Qiang Wang ; Huang, Joshua Zhexue

  • Author_Institution
    Shenzhen Key Lab. of High Performance Data Min., Shenzhen Univ. Town, Shenzhen, China
  • fYear
    2013
  • fDate
    4-9 Aug. 2013
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Linear classification is an important technique in machine learning and data mining, and development of fast optimization methods for training linear classification models is a hot research topic. Stochastic gradient descent (SGD) can achieve relatively good results quickly, but unstable to converge. Limited-memory BFGS (L-BFGS) method converges, but takes a long time to train the model, as it needs to compute the gradient from the entire data set to make an update. In this paper, we investigate a hybrid method that integrates SGD and L-BFGS into a new optimization process SGD-LBFGS to take advantages of both optimization methods. In SGD-LBFGS, SGD is used to run initial iterations to obtain a suboptimal result, and then L-BFGS takes over to continue the optimization process until the process converges and a better model is built. We present a theoretical result to prove that SGD-LBFGS converges faster than SGD and L-BFGS. Experiment analysis on 6 real world data sets have shown that SGD-LBFGS converged 77% faster than L-BFGS on average and demonstrated more stable results than SGD.
  • Keywords
    data mining; gradient methods; iterative methods; learning (artificial intelligence); optimisation; pattern classification; stochastic processes; building linear classification model acceleration; data mining; fast optimization methods; hybrid optimization method; limited-memory BFGS method; machine learning; optimization process SGD-LBFGS; stochastic gradient descent; Convergence; Learning systems; Linear programming; Logistics; Optimization methods; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2013 International Joint Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    2161-4393
  • Print_ISBN
    978-1-4673-6128-6
  • Type

    conf

  • DOI
    10.1109/IJCNN.2013.6707017
  • Filename
    6707017