• DocumentCode
    3579124
  • Title

    An enhanced feature selection method comprising rough set and clustering techniques

  • Author

    Murugan, A. ; Sridevi, T.

  • Author_Institution
    Department of Computer Science, Dr. Ambedkar Govt. Arts College, Chennai, Tamil Nadu, India
  • fYear
    2014
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Feature selection or variable reduction is a fundamental problem in data mining, refers to the process of identifying the few most important features for application of a learning algorithm. The best subset contains the minimum number of dimensions retaining a suitably high accuracy on classifier in representing the original features. The objective of the proposed approach is to reduce the number of input features thus to identify the key features and eliminating irrelevant features with no predictive information using clustering technique, K-nearest neighbors (KNN) and rough set. This paper deals with two partition based clustering algorithm in data mining namely K-Means and Fuzzy C Means (FCM). These two algorithms are implemented for original data set without considering the class labels and further rough set theory implemented on the partitioned data set to generate feature subset after removing the outlier by using KNN. Wisconsin Breast Cancer datasets derived from UCI machine learning database are used for the purpose of testing the proposed hybrid method. The results show that the hybrid method is able to produce more accurate diagnosis and prognosis results than the full input model with respect to the classification accuracy.
  • Keywords
    Accuracy; Algorithm design and analysis; Breast cancer; Classification algorithms; Clustering algorithms; Data mining; Partitioning algorithms; classification; fcm; feature selection; k-means; machine learning; outlier detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Computing Research (ICCIC), 2014 IEEE International Conference on
  • Print_ISBN
    978-1-4799-3974-9
  • Type

    conf

  • DOI
    10.1109/ICCIC.2014.7238376
  • Filename
    7238376