• DocumentCode
    839968
  • Title

    Feature Selection Using f-Information Measures in Fuzzy Approximation Spaces

  • Author

    Maji, Pradipta ; Pal, Sankar K.

  • Author_Institution
    Machine Intell. Unit, Indian Stat. Inst., Kolkata, India
  • Volume
    22
  • Issue
    6
  • fYear
    2010
  • fDate
    6/1/2010 12:00:00 AM
  • Firstpage
    854
  • Lastpage
    867
  • Abstract
    The selection of nonredundant and relevant features of real-valued data sets is a highly challenging problem. A novel feature selection method is presented here based on fuzzy-rough sets by maximizing the relevance and minimizing the redundancy of the selected features. By introducing the fuzzy equivalence partition matrix, a novel representation of Shannon´s entropy for fuzzy approximation spaces is proposed to measure the relevance and redundancy of features suitable for real-valued data sets. The fuzzy equivalence partition matrix also offers an efficient way to calculate many more information measures, termed as f-information measures. Several f-information measures are shown to be effective for selecting nonredundant and relevant features of real-valued data sets. This paper compares the performance of different f-information measures for feature selection in fuzzy approximation spaces. Some quantitative indexes are introduced based on fuzzy-rough sets for evaluating the performance of proposed method. The effectiveness of the proposed method, along with a comparison with other methods, is demonstrated on a set of real-life data sets.
  • Keywords
    approximation theory; data mining; entropy; fuzzy set theory; matrix algebra; rough set theory; Shannon entropy; data mining; f-information measures; feature selection method; fuzzy approximation spaces; fuzzy equivalence partition matrix; fuzzy-rough sets; Pattern recognition; data mining; f-information measures.; feature selection; fuzzy-rough sets;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2009.124
  • Filename
    4912208