DocumentCode :
3169396
Title :
Feature selection for clustering problems: a hybrid algorithm that iterates between k-means and a Bayesian filter
Author :
Hruschka, Eduardo R. ; Hruschka, Estevam R., Jr. ; Covões, Thiago F. ; Ebecken, Nelson F F
Author_Institution :
Catholic Univ. of Santos, Brazil
fYear :
2005
fDate :
6-9 Nov. 2005
Abstract :
There are two fundamentally different approaches for feature selection: wrapper and filter. It is also possible to combine them, obtaining hybrid approaches. This paper describes a hybrid method for selecting relevant features in clustering problems. The proposed approach is based on the combination of the widely known k-means algorithm and a Bayesian filter, which is based on the Markov Blanket concept. Since the number of clusters and the subset of relevant features are usually inter-related, we propose a method that iterates between clustering (assuming that the number of clusters is not known a priori) and filtering. Experiments in a number of datasets show that the proposed approach allows selecting features that provide good partitions.
Keywords :
Bayes methods; feature extraction; filtering theory; pattern clustering; Bayesian filter; Markov blanket concept; feature selection; filter approach; hybrid algorithm; hybrid method; k-means algorithm; wrapper approach; Bayesian methods; Clustering algorithms; Clustering methods; Data analysis; Data visualization; Filtering; Filters; Gene expression; Supervised learning; Text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Hybrid Intelligent Systems, 2005. HIS '05. Fifth International Conference on
Print_ISBN :
0-7695-2457-5
Type :
conf
DOI :
10.1109/ICHIS.2005.42
Filename :
1587781
Link To Document :
بازگشت