Title :
A Variance Minimization Criterion to Feature Selection Using Laplacian Regularization
Author :
He, Xiaofei ; Ji, Ming ; Zhang, Chiyuan ; Bao, Hujun
Author_Institution :
State Key Lab. of CAD&CG, Zhejiang Univ., Hangzhou, China
Abstract :
In many information processing tasks, one is often confronted with very high-dimensional data. Feature selection techniques are designed to find the meaningful feature subset of the original features which can facilitate clustering, classification, and retrieval. In this paper, we consider the feature selection problem in unsupervised learning scenarios, which is particularly difficult due to the absence of class labels that would guide the search for relevant information. Based on Laplacian regularized least squares, which finds a smooth function on the data manifold and minimizes the empirical loss, we propose two novel feature selection algorithms which aim to minimize the expected prediction error of the regularized regression model. Specifically, we select those features such that the size of the parameter covariance matrix of the regularized regression model is minimized. Motivated from experimental design, we use trace and determinant operators to measure the size of the covariance matrix. Efficient computational schemes are also introduced to solve the corresponding optimization problems. Extensive experimental results over various real-life data sets have demonstrated the superiority of the proposed algorithms.
Keywords :
covariance matrices; information retrieval; least squares approximations; minimisation; pattern classification; pattern clustering; regression analysis; unsupervised learning; Laplacian regularization; Laplacian regularized least squares; data manifold; feature classification; feature clustering; feature retrieval; feature selection; information processing task; parameter covariance matrix; regularized regression model; unsupervised learning; variance minimization criterion; Algorithm design and analysis; Clustering algorithms; Covariance matrix; Feature extraction; Laplace equations; Manifolds; Optimization; Feature selection; clustering.; dimensionality reduction; manifold; regression; regularization;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
DOI :
10.1109/TPAMI.2011.44