Title of article :
FFS: An F-DBSCAN Clustering- Based Feature Selection for Classification Data
Author/Authors :
Eshaghi, Nasim Department of Computer Engineering - Rouzbahan University, Sari, Iran , Aghagolzadeh, Ali Faculty of Electrical and Computer Engineering - Babol Noshirvani University of Technology, Babol, Iran
Pages :
12
From page :
43
To page :
54
Abstract :
Feature selection is an important step in most classification problems to select an optimal subset of features to increase the learning accuracy and reduce the computational time. In this paper we proposed a new feature clustering based method to perform feature selection (FFS) in classification problems. The FFS algorithm works in two steps. In the first step, features are divided into clusters by using F-DBSCAN method. A novel F-DBSCAN clustering method used mutual information for measuring dependencies between features. In the second step, the most representative feature is selected from each cluster by a new criterion function. This allows us to consider the possible dependency on the target class and the redundancy between the selected features in each cluster. The experimental results on different datasets show that the proposed algorithm is more effective for feature selection in classification problems. Compared with the other methods, the average classification accuracy of C4.5, KNN and Naïve Bayes are improved using FFS by 8.05, 8.36 and 4.63 percent, respectively. Also, the results demonstrate that the FFS algorithm produces small subsets of features with very high classification rate.
Keywords :
Feature Selection , Mutual Information , Feature Clustering
Journal title :
Journal of Advances in Computer Research
Serial Year :
2017
Record number :
2497486
Link To Document :
بازگشت