Title of article :
Towards improving cluster-based feature selection with a simplified silhouette filter
Author/Authors :
Thiago F. Cov?es، نويسنده , , Eduardo R. Hruschka، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2011
Pages :
17
From page :
3766
To page :
3782
Abstract :
This paper proposes a filter-based algorithm for feature selection. The filter is based on the partitioning of the set of features into clusters. The number of clusters, and consequently the cardinality of the subset of selected features, is automatically estimated from data. The computational complexity of the proposed algorithm is also investigated. A variant of this filter that considers feature-class correlations is also proposed for classification problems. Empirical results involving ten datasets illustrate the performance of the developed algorithm, which in general has obtained competitive results in terms of classification accuracy when compared to state of the art algorithms that find clusters of features. We show that, if computational efficiency is an important issue, then the proposed filter may be preferred over their counterparts, thus becoming eligible to join a pool of feature selection algorithms to be used in practice. As an additional contribution of this work, a theoretical framework is used to formally analyze some properties of feature selection methods that rely on finding clusters of features.
Keywords :
feature selection , Clustering , filters , Classification
Journal title :
Information Sciences
Serial Year :
2011
Journal title :
Information Sciences
Record number :
1214586
Link To Document :
بازگشت