Title :
An Idea of setting weighting functions for feature selection
Author :
Weijie Li ; Haiqiang Chen ; Wei Cao ; Xin Zhou
Author_Institution :
China Inf. Technol. Security Evaluation Center, Beijing, China
fDate :
Oct. 30 2012-Nov. 1 2012
Abstract :
In this paper, we propose a novel feature selection method, which improves effectively traditional mutual information based feature selection. The method takes as the first step traditional mutual information based feature selection. Then the method multiplies each feature by a weighting coefficient that is directly related to the mutual information value between the feature and class labels. Finally the multiplication results of the features with large mutual values are used as final features for classification. The result of nearest neighbor (NN) classification on spam emails filter and prediction of molecular bioactivity shows that the proposed method is able to improve the performance of NN classification. In additional, using fewer features NN classification is capable of achieving the same accuracy as NN classification using all of original features.
Keywords :
learning (artificial intelligence); pattern classification; NN classification; class labels; feature classification; feature labels; feature selection method; molecular bioactivity prediction; mutual information based feature selection; nearest neighbor classification; spam emails filter; weighting coefficient; weighting functions; Accuracy; Compounds; Correlation; Electronic mail; Mutual information; Pattern classification; Training; Feature selection; Mutual information; Nearest neighbor classification; Pattern recognition; Weighting coefficient;
Conference_Titel :
Cloud Computing and Intelligent Systems (CCIS), 2012 IEEE 2nd International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4673-1855-6
DOI :
10.1109/CCIS.2012.6664263