• DocumentCode
    2671159
  • Title

    A hybrid feature selection method for data sets of thousands of variables

  • Author

    Liu, Jihong ; Wang, Guoxiong

  • Author_Institution
    Coll. of Inf. Sci. & Eng., Northeastern Univ., Shenyang, China
  • Volume
    2
  • fYear
    2010
  • fDate
    27-29 March 2010
  • Firstpage
    288
  • Lastpage
    291
  • Abstract
    Feature selection has become the focus of research areas of applications with datasets of thousands of variables. In this study we present a hybrid feature selection (HFS) method that adopts both filter and wrapper models of feature subset selection. In the first stage of the feature selection, we use the filter model to rank the features by the mutual information (MI) between each feature and each class, and then choose k highest relevant features to the classes. In the second stage, we complete a wrapper model based feature selection algorithm, which uses Shepley value to evaluate the contribution of features to the classification task in a feature subset. Experimental results show obviously that the HFS method obtains better classification performance than solo Shepley value based or solo MI based feature selection method.
  • Keywords
    classification; game theory; Shepley value; classification performance; classification task; data sets; feature selection algorithm; feature subset selection; filter model; hybrid feature selection method; mutual information; wrapper model; Classification algorithms; Data engineering; Educational institutions; Information filtering; Information filters; Information science; Internet; Mutual information; Space exploration; Text processing; Shepley value; feature selection; mutual information;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Computer Control (ICACC), 2010 2nd International Conference on
  • Conference_Location
    Shenyang
  • Print_ISBN
    978-1-4244-5845-5
  • Type

    conf

  • DOI
    10.1109/ICACC.2010.5486671
  • Filename
    5486671