Title :
Model Selection via Bilevel Optimization
Author :
Bennett, Kristin P. ; Hu, Jing ; Ji, Xiaoyun ; Kunapuli, Gautam ; Pang, Jong-Shi
Author_Institution :
Rensselaer Polytech. Inst., Troy
Abstract :
A key step in many statistical learning methods used in machine learning involves solving a convex optimization problem containing one or more hyper-parameters that must be selected by the users. While cross validation is a commonly employed and widely accepted method for selecting these parameters, its implementation by a grid-search procedure in the parameter space effectively limits the desirable number of hyper-parameters in a model, due to the combinatorial explosion of grid points in high dimensions. This paper proposes a novel bilevel optimization approach to cross validation that provides a systematic search of the hyper-parameters. The bilevel approach enables the use of the state-of-the-art optimization methods and their well-supported softwares. After introducing the bilevel programming approach, we discuss computational methods for solving a bilevel cross-validation program, and present numerical results to substantiate the viability of this novel approach as a promising computational tool for model selection in machine learning.
Keywords :
convex programming; learning (artificial intelligence); statistical analysis; bilevel optimization; bilevel optimization approach; bilevel programming approach; combinatorial explosion; computational methods; convex optimization problem; cross validation; grid-search procedure; hyper-parameters; machine learning; model selection; statistical learning methods; Computational modeling; Data analysis; Explosions; Filters; Kernel; Machine learning; Optimization methods; Statistical learning; Support vector machine classification; Support vector machines;
Conference_Titel :
Neural Networks, 2006. IJCNN '06. International Joint Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
0-7803-9490-9
DOI :
10.1109/IJCNN.2006.246935