DocumentCode
2671159
Title
A hybrid feature selection method for data sets of thousands of variables
Author
Liu, Jihong ; Wang, Guoxiong
Author_Institution
Coll. of Inf. Sci. & Eng., Northeastern Univ., Shenyang, China
Volume
2
fYear
2010
fDate
27-29 March 2010
Firstpage
288
Lastpage
291
Abstract
Feature selection has become the focus of research areas of applications with datasets of thousands of variables. In this study we present a hybrid feature selection (HFS) method that adopts both filter and wrapper models of feature subset selection. In the first stage of the feature selection, we use the filter model to rank the features by the mutual information (MI) between each feature and each class, and then choose k highest relevant features to the classes. In the second stage, we complete a wrapper model based feature selection algorithm, which uses Shepley value to evaluate the contribution of features to the classification task in a feature subset. Experimental results show obviously that the HFS method obtains better classification performance than solo Shepley value based or solo MI based feature selection method.
Keywords
classification; game theory; Shepley value; classification performance; classification task; data sets; feature selection algorithm; feature subset selection; filter model; hybrid feature selection method; mutual information; wrapper model; Classification algorithms; Data engineering; Educational institutions; Information filtering; Information filters; Information science; Internet; Mutual information; Space exploration; Text processing; Shepley value; feature selection; mutual information;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Computer Control (ICACC), 2010 2nd International Conference on
Conference_Location
Shenyang
Print_ISBN
978-1-4244-5845-5
Type
conf
DOI
10.1109/ICACC.2010.5486671
Filename
5486671
Link To Document