DocumentCode :
2207570
Title :
Feature Selection for Unsupervised Learning Using Random Cluster Ensembles
Author :
Elghazel, Haytham ; Aussem, Alex
Author_Institution :
Univ. de Lyon, Lyon, France
fYear :
2010
fDate :
13-17 Dec. 2010
Firstpage :
168
Lastpage :
175
Abstract :
In this paper, we propose another extension of the Random Forests paradigm to unlabeled data, leading to localized unsupervised feature selection (FS). We show that the way internal estimates are used to measure variable importance in Random Forests are also applicable to FS in unsupervised learning. We first illustrate the clustering performance of the proposed method on various data sets based on widely used external criteria of clustering quality. We then assess the accuracy and the scalability of the FS procedure on UCI and real labeled data sets and compare its effectiveness against other FS methods.
Keywords :
feature extraction; pattern clustering; unsupervised learning; FS method; FS procedure; UCI; clustering performance; clustering quality; feature selection; random cluster ensemble; random forest paradigm; real labeled data set; unlabeled data; unsupervised learning; variable importance; Random Forest; Unsupervised learning; feature selection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2010 IEEE 10th International Conference on
Conference_Location :
Sydney, NSW
ISSN :
1550-4786
Print_ISBN :
978-1-4244-9131-5
Electronic_ISBN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2010.137
Filename :
5693970
Link To Document :
بازگشت