Title :
Multi-pruning of decision trees for knowledge representation and classification
Author :
Mohammad Azad;Igor Chikalov;Shahid Hussain;Mikhail Moshkov
Author_Institution :
King Abdullah University of Science and Technology, Thuwal 23955-6900, Saudi Arabia
Abstract :
We consider two important questions related to decision trees: first how to construct a decision tree with reasonable number of nodes and reasonable number of misclassification, and second how to improve the prediction accuracy of decision trees when they are used as classifiers. We have created a dynamic programming based approach for bi-criteria optimization of decision trees relative to the number of nodes and the number of misclassification. This approach allows us to construct the set of all Pareto optimal points and to derive, for each such point, decision trees with parameters corresponding to that point. Experiments on datasets from UCI ML Repository show that, very often, we can find a suitable Pareto optimal point and derive a decision tree with small number of nodes at the expense of small increment in number of misclassification. Based on the created approach we have proposed a multi-pruning procedure which constructs decision trees that, as classifiers, often outperform decision trees constructed by CART.
Keywords :
"Decision trees","Pareto optimization","Dynamic programming","Knowledge representation","Training","Testing"
Conference_Titel :
Pattern Recognition (ACPR), 2015 3rd IAPR Asian Conference on
Electronic_ISBN :
2327-0985
DOI :
10.1109/ACPR.2015.7486574