Title :
Data tuner for effective data pre-processing
Author :
Balamurugan, S.A.A. ; Christopher, A. B Arockia
Author_Institution :
Dept. of Inf. Technol., Thiagarajar Coll. of Eng., Madurai, India
Abstract :
In real world datasets, lots of redundant and conflicting data exists. The performance of a classification algorithm in data mining is greatly affected by noisy information (i.e. redundant and conflicting data). These parameters not only increase the cost of mining process, but also degrade the detection performance of the classifiers. They have to be removed to increase the efficiency and accuracy of the classifiers. This process is called as the tuning of the dataset. The redundancy check will be performed on the original dataset and the resultant is to be preserved. This resultant dataset is to be then checked for conflicting data and if they will be corrected and updated to the original dataset. This updated dataset is to be then classified using a variety of classifiers like Multilayer perceptron, SVM, Decision stump, Kstar, LWL, Rep tree, Decision table, ID3, J48 and Naïve Bayes. The performance of the updated datasets on these classifiers is to be found. The results will show a significant improvement in the classification accuracy when redundancy and conflicts are to be removed. The conflicts after correction ate be updated to the original dataset, and when the performance of the classifier is to be evaluated, great improvement is to be witnessed.
Keywords :
Bayes methods; data mining; decision tables; multilayer perceptrons; support vector machines; ID3; J48; Kstar; LWL; Rep tree; SVM; classification algorithm; data mining; data preprocessing; data tuner; dataset tuning; decision stump; decision table; multilayer perceptron; naive Bayes method; redundancy check; Cleaning; Data mining; Humans; Noise measurement; classification algorithm; data mining; redundancy conflicting data;
Conference_Titel :
Advances in Engineering, Science and Management (ICAESM), 2012 International Conference on
Conference_Location :
Nagapattinam, Tamil Nadu
Print_ISBN :
978-1-4673-0213-5