DocumentCode
2153447
Title
An empirical study on dimensionality reduction and improvement of classification accuracy using feature subset selection and ranking
Author
Antony Gnana Singh, D.Asir ; alias Balamurugan, S.Appavu ; Leavline, E.Jebamalar
Author_Institution
Department of CSE M.I.E.T Engineering College, Tiruchirappalli - 7, India
fYear
2012
fDate
13-14 Dec. 2012
Firstpage
102
Lastpage
108
Abstract
Data mining is a part in the process of Knowledge discovery from data (KDD). The performance of data mining algorithms mainly depends on the effectiveness of preprocessing algorithms. Dimensionality reduction plays an important role in preprocessing. By research, many methods have been proposed for dimensionality reduction, beside the feature subset selection and feature-ranking methods show significant achievement in dimensionality reduction by removing irrelevant and redundant features in high-dimensional data. This improves the prediction accuracy of the classifier, reduces the false prediction ratio and reduces the time and space complexity for building the prediction model. This paper presents an empirical study analysis on feature subset evaluators Cfs, Consistency and Filtered, Feature Rankers Chi-squared and Information-gain. The performance of these methods is analyzed with the focus on dimensionality reduction and improvement of classification accuracy using wide range of test datasets and classification algorithms namely probability-based Naive Bayes, tree-based C4.5(J48) and instance-based IBl.
Keywords
Classification Accuracy; Classifier; Data mining; Data prepocessing; Dimensionality Reduction; Feature subset selection; Machine learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Emerging Trends in Science, Engineering and Technology (INCOSET), 2012 International Conference on
Conference_Location
Tiruchirappalli, Tamilnadu, India
Print_ISBN
978-1-4673-5141-6
Type
conf
DOI
10.1109/INCOSET.2012.6513889
Filename
6513889
Link To Document