DocumentCode :
3186528
Title :
Semi-supervised feature selection based on label propagation and subset selection
Author :
Liu, Yun ; Nie, Feiping ; Wu, Jigang ; Chen, Lihui
Author_Institution :
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear :
2010
fDate :
3-5 Dec. 2010
Firstpage :
293
Lastpage :
296
Abstract :
In practice, the data to be handled are often high dimensional, and labeled data are often very limited while a large numbers of unlabeled data can be easily collected. Feature selection is an important method to deal with high dimensional data. In this paper, we propose a novel semi-supervised feature selection algorithm to select relevant features using both labeled and unlabeled data. Specifically, the algorithm explores the distribution of the labeled and unlabeled data with a special label propagation method to obtain the soft labels of unlabeled data, then an efficient algorithm to optimize the trace ratio criterion is used to directly select the optimal feature subset. Experimental results verify the effectiveness of the proposed algorithm, and show significant improvement over traditional supervised feature selection algorithms.
Keywords :
data handling; feature extraction; pattern classification; label propagation method; optimal feature subset; semisupervised feature selection; soft label; subset selection; unlabeled data; Accuracy; Computers; Educational institutions; Harmonic analysis; Probabilistic logic; Training; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Application (ICCIA), 2010 International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-1-4244-8597-0
Type :
conf
DOI :
10.1109/ICCIA.2010.6141595
Filename :
6141595
Link To Document :
بازگشت