• DocumentCode
    3169396
  • Title

    Feature selection for clustering problems: a hybrid algorithm that iterates between k-means and a Bayesian filter

  • Author

    Hruschka, Eduardo R. ; Hruschka, Estevam R., Jr. ; Covões, Thiago F. ; Ebecken, Nelson F F

  • Author_Institution
    Catholic Univ. of Santos, Brazil
  • fYear
    2005
  • fDate
    6-9 Nov. 2005
  • Abstract
    There are two fundamentally different approaches for feature selection: wrapper and filter. It is also possible to combine them, obtaining hybrid approaches. This paper describes a hybrid method for selecting relevant features in clustering problems. The proposed approach is based on the combination of the widely known k-means algorithm and a Bayesian filter, which is based on the Markov Blanket concept. Since the number of clusters and the subset of relevant features are usually inter-related, we propose a method that iterates between clustering (assuming that the number of clusters is not known a priori) and filtering. Experiments in a number of datasets show that the proposed approach allows selecting features that provide good partitions.
  • Keywords
    Bayes methods; feature extraction; filtering theory; pattern clustering; Bayesian filter; Markov blanket concept; feature selection; filter approach; hybrid algorithm; hybrid method; k-means algorithm; wrapper approach; Bayesian methods; Clustering algorithms; Clustering methods; Data analysis; Data visualization; Filtering; Filters; Gene expression; Supervised learning; Text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Hybrid Intelligent Systems, 2005. HIS '05. Fifth International Conference on
  • Print_ISBN
    0-7695-2457-5
  • Type

    conf

  • DOI
    10.1109/ICHIS.2005.42
  • Filename
    1587781