DocumentCode
2235770
Title
An Idea of setting weighting functions for feature selection
Author
Weijie Li ; Haiqiang Chen ; Wei Cao ; Xin Zhou
Author_Institution
China Inf. Technol. Security Evaluation Center, Beijing, China
fYear
2012
fDate
Oct. 30 2012-Nov. 1 2012
Firstpage
690
Lastpage
695
Abstract
In this paper, we propose a novel feature selection method, which improves effectively traditional mutual information based feature selection. The method takes as the first step traditional mutual information based feature selection. Then the method multiplies each feature by a weighting coefficient that is directly related to the mutual information value between the feature and class labels. Finally the multiplication results of the features with large mutual values are used as final features for classification. The result of nearest neighbor (NN) classification on spam emails filter and prediction of molecular bioactivity shows that the proposed method is able to improve the performance of NN classification. In additional, using fewer features NN classification is capable of achieving the same accuracy as NN classification using all of original features.
Keywords
learning (artificial intelligence); pattern classification; NN classification; class labels; feature classification; feature labels; feature selection method; molecular bioactivity prediction; mutual information based feature selection; nearest neighbor classification; spam emails filter; weighting coefficient; weighting functions; Accuracy; Compounds; Correlation; Electronic mail; Mutual information; Pattern classification; Training; Feature selection; Mutual information; Nearest neighbor classification; Pattern recognition; Weighting coefficient;
fLanguage
English
Publisher
ieee
Conference_Titel
Cloud Computing and Intelligent Systems (CCIS), 2012 IEEE 2nd International Conference on
Conference_Location
Hangzhou
Print_ISBN
978-1-4673-1855-6
Type
conf
DOI
10.1109/CCIS.2012.6664263
Filename
6664263
Link To Document