Title :
Comparing the knowledge quality in rough classifier and decision tree classifier
Author :
Mohsin, Mohamad Farhan Mohamad ; Wahab, Mohd Helmy Abd
Author_Institution :
College of Arts & Sciences, Universiti Utara Malaysia, 06010 UUM Sintok, Kedah, Malaysia
Abstract :
This paper presents a comparative study of two rule based classifier; rough set (Rc) and decision tree (DTc). Both techniques apply different approach to perform classification but produce same structure of output with comparable result. Theoretically, different classifiers will generate different sets of rules via knowledge even though they are implemented to the same classification problem. Hence, the aim of this paper is to investigate the quality of knowledge produced by Rc and DTc when similar problems are presented to them. In this case, four important performance metrics are used as comparison, the accuracy of classification, rules quantity, rules length and rules coverage. Five dataset from UCI Machine Learning are chosen and then mined using Rc toolkit namely ROSETTA while C4.5 algorithm in WEKA application is chosen as DTc rule generator. The experimental result shows that Rc and DTc own capability to generate quality knowledge since most of the results are comparable. Rc outperform as an accurate classifier, produce shorter and simpler rule with higher coverage. Meanwhile, DTc obviously generates fewer numbers of rules with significant difference.
Keywords :
Art; Classification tree analysis; Data engineering; Data mining; Decision trees; Educational institutions; Machine learning; Machine learning algorithms; Measurement; Testing;
Conference_Titel :
Information Technology, 2008. ITSim 2008. International Symposium on
Conference_Location :
Kuala Lumpur, Malaysia
Print_ISBN :
978-1-4244-2327-9
Electronic_ISBN :
978-1-4244-2328-6
DOI :
10.1109/ITSIM.2008.4631700