• DocumentCode
    1245655
  • Title

    Toward integrating feature selection algorithms for classification and clustering

  • Author

    Liu, Huan ; Yu, Lei

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Arizona State Univ., Tempe, AZ, USA
  • Volume
    17
  • Issue
    4
  • fYear
    2005
  • fDate
    4/1/2005 12:00:00 AM
  • Firstpage
    491
  • Lastpage
    502
  • Abstract
    This paper introduces concepts and algorithms of feature selection, surveys existing feature selection algorithms for classification and clustering, groups and compares different algorithms with a categorizing framework based on search strategies, evaluation criteria, and data mining tasks, reveals unattempted combinations, and provides guidelines in selecting feature selection algorithms. With the categorizing framework, we continue our efforts toward-building an integrated system for intelligent feature selection. A unifying platform is proposed as an intermediate step. An illustrative example is presented to show how existing feature selection algorithms can be integrated into a meta algorithm that can take advantage of individual algorithms. An added advantage of doing so is to help a user employ a suitable algorithm without knowing details of each algorithm. Some real-world applications are included to demonstrate the use of feature selection in data mining. We conclude this work by identifying trends and challenges of feature selection research and development.
  • Keywords
    data mining; feature extraction; meta data; pattern classification; pattern clustering; very large databases; categorizing framework; data classification; data clustering; data mining; feature selection; real-world applications; search strategies; unifying platform; Classification algorithms; Clustering algorithms; Data mining; Data preprocessing; Filters; Input variables; Machine learning; Research and development; Spatial databases; Statistics; Index Terms- Feature selection; categorizing framework; classification; clustering; real-world applications.; unifying platform;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2005.66
  • Filename
    1401889