DocumentCode
3579124
Title
An enhanced feature selection method comprising rough set and clustering techniques
Author
Murugan, A. ; Sridevi, T.
Author_Institution
Department of Computer Science, Dr. Ambedkar Govt. Arts College, Chennai, Tamil Nadu, India
fYear
2014
Firstpage
1
Lastpage
4
Abstract
Feature selection or variable reduction is a fundamental problem in data mining, refers to the process of identifying the few most important features for application of a learning algorithm. The best subset contains the minimum number of dimensions retaining a suitably high accuracy on classifier in representing the original features. The objective of the proposed approach is to reduce the number of input features thus to identify the key features and eliminating irrelevant features with no predictive information using clustering technique, K-nearest neighbors (KNN) and rough set. This paper deals with two partition based clustering algorithm in data mining namely K-Means and Fuzzy C Means (FCM). These two algorithms are implemented for original data set without considering the class labels and further rough set theory implemented on the partitioned data set to generate feature subset after removing the outlier by using KNN. Wisconsin Breast Cancer datasets derived from UCI machine learning database are used for the purpose of testing the proposed hybrid method. The results show that the hybrid method is able to produce more accurate diagnosis and prognosis results than the full input model with respect to the classification accuracy.
Keywords
Accuracy; Algorithm design and analysis; Breast cancer; Classification algorithms; Clustering algorithms; Data mining; Partitioning algorithms; classification; fcm; feature selection; k-means; machine learning; outlier detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Computing Research (ICCIC), 2014 IEEE International Conference on
Print_ISBN
978-1-4799-3974-9
Type
conf
DOI
10.1109/ICCIC.2014.7238376
Filename
7238376
Link To Document