Title :
Pruning with Majority and Minority Properties
Author :
Hae Sook Jeon ; Won Don Lee
Author_Institution :
IT Convergence Technol. Res. Lab., Electron. & Telecommun. Res. Inst., Daejeon, South Korea
Abstract :
Classification is very imprtant research in knowledge discovery and machine learning. The decision tree is one of the well-known data mining methods. In general, a decision tree can be grown so as to have zero eeor on the training data set. If there is any noise in the data set or it does not completely cover the decision space, then over-fitting occurs and the tree needs to be pruned in order to accurately generalize the test data set. In this paper, we propose a pre-pruning method with majority and minority properties for the decision tree. It uses two kinds of qualifying criteria to consider whether the ration of the highest class of a subtree is the majority of the subtree or a minority of the overall tree. New measures for these are added to the classifier with the extended data expression. Experiments show that a clasifier using this pruning method can improve classification accuracy as well as reduce the size of the tree.
Keywords :
data mining; decision trees; learning (artificial intelligence); pattern classification; classification accuracy; data expression; data mining methods; decision tree; knowledge discovery; machine learning; pattern classification; pre-pruning method; subtree; Accuracy; Decision trees; Equations; Error analysis; Gain measurement; Rain; Training data;
Conference_Titel :
Information Science and Applications (ICISA), 2014 International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4799-4443-9
DOI :
10.1109/ICISA.2014.6847450