Title :
Feature selection technology based on sample unbiased evaluation
Author :
Huaiguang Liu; Jianyi Kong; Xingdong Wang; Yuanjiong Liu
Author_Institution :
School of Mechanical and Automation Engineering, Wuhan University of Science and Technology, China
Abstract :
This paper focuses on the feature selection methods for unbalanced data sets which have variant sizes of classes. ReliefF has proved to be a successful method for selecting irrelevant features, whereas it is considered as a biased approach for the unbalanced data sets. This paper describes an effective fair method to overcome the defect. Furthermore, against the sensitivity of ReliefF to noisy or irrelevant features when selecting k nearest samples, feature distance is proposed to substitute for the Euclidean distance. Experiments on manual data and UCI data sets indicated that the improved method works better than ReliefF and InfoGain when used as a preprocessing step for naive Bayes and C4.5.
Keywords :
"Noise measurement","Euclidean distance","Manuals","Bismuth","Algorithm design and analysis","Classification algorithms","Training data"
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2015 12th International Conference on
DOI :
10.1109/FSKD.2015.7382025