Title of article :
Regularization through variable selection and conditional MLE with application to classification in high dimensions
Author/Authors :
Eitan Greenshtein، نويسنده , , Eitan and Park، نويسنده , , Junyong and Lebanon، نويسنده , , Guy، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2009
Abstract :
It is often the case that high-dimensional data consist of only a few informative components. Standard statistical modeling and estimation in such a situation is prone to inaccuracies due to overfitting, unless regularization methods are practiced. In the context of classification, we propose a class of regularization methods through shrinkage estimators. The shrinkage is based on variable selection coupled with conditional maximum likelihood. Using Steinʹs unbiased estimator of the risk, we derive an estimator for the optimal shrinkage method within a certain class. A comparison of the optimal shrinkage methods in a classification context, with the optimal shrinkage method when estimating a mean vector under a squared loss, is given. The latter problem is extensively studied, but it seems that the results of those studies are not completely relevant for classification. We demonstrate and examine our method on simulated data and compare it to feature annealed independence rule and Fisherʹs rule.
Keywords :
Classification , High dimensions , Steinיs unbiased estimator , Conditional MLE
Journal title :
Journal of Statistical Planning and Inference
Journal title :
Journal of Statistical Planning and Inference