• DocumentCode
    508294
  • Title

    A Novel Hypothesis-Margin Based Method Incorporating Minimal-Redundancy Criterion for Feature Selection

  • Author

    Yang, Ming ; Yang, Ping

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Nanjing Normal Univ., Nanjing, China
  • Volume
    2
  • fYear
    2009
  • fDate
    14-16 Aug. 2009
  • Firstpage
    189
  • Lastpage
    194
  • Abstract
    Simba is a recently proposed algorithm based on hypothesis-margin for feature selection, it uses maximizing hypothesis-margin as a criterion for evaluating the effectiveness of a feature subset, in this way an effective feature subset can be efficiently obtained by employing the stochastic gradient ascent strategy. However, this algorithm still can not eliminate completely those redundant features. To overcome this drawback, in this paper, we propose a novel hypothesis-margin based method for feature selection incorporating minimal-redundant criterion (Rsimba). In Rsimba, after getting the weights of features by employing hypothesis-margin strategy, the mutual information criterion induced by clustering is introduced for removing those redundant features, in this way an effectively relevant feature subset can be efficiently obtained. Experiments show that the classification performance induced by Rsimba is better than that induced by Simba on all benchmark data sets.
  • Keywords
    algorithm theory; learning (artificial intelligence); pattern clustering; Rsimba criterion; feature selection; feature subset; hypothesis margin algorithm; minimal redundancy criterion; mutual information criterion; pattern clustering; stochastic gradient ascent strategy; Clustering algorithms; Computer science; Data mining; Feature extraction; Filters; Machine learning; Mathematics; Mutual information; Prediction algorithms; Stochastic processes; Feature selection; Hypothesis-margin; minimal-redundant;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Computation, 2009. ICNC '09. Fifth International Conference on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-0-7695-3736-8
  • Type

    conf

  • DOI
    10.1109/ICNC.2009.105
  • Filename
    5366499