DocumentCode :
2914236
Title :
A clustering-based decision tree induction algorithm
Author :
Barros, Rodrigo C. ; De Carvalho, André C P L F ; Basgalupp, Márcio P. ; Quiles, Marcos G.
Author_Institution :
Dept. of Comput. Sci., Univ. of Sao Paulo (USP), São Carlos, Brazil
fYear :
2011
fDate :
22-24 Nov. 2011
Firstpage :
543
Lastpage :
550
Abstract :
Decision tree induction algorithms are well known techniques for assigning objects to predefined categories in a transparent fashion. Most decision tree induction algorithms rely on a greedy top-down recursive strategy for growing the tree, and pruning techniques to avoid overfitting. Even though such a strategy has been quite successful in many problems, it falls short in several others. For instance, there are cases in which the hyper-rectangular surfaces generated by these algorithms can only map the problem description after several sub-sequential partitions, which results in a large and incomprehensible tree. Hence, we propose a new decision tree induction algorithm based on clustering which seeks to provide more accurate models and/or shorter descriptions more comprehensible for the end-user. We do not base our performance analysis solely on the straightforward comparison of our proposed algorithm to baseline methods. Instead, we propose a data-dependent analysis in order to look for evidences which may explain in which situations our algorithm outperforms a well-known decision tree induction algorithm.
Keywords :
data analysis; decision trees; learning (artificial intelligence); pattern clustering; clustering-based decision tree induction algorithm; data-dependent analysis; greedy top-down recursive strategy; hyper-rectangular surfaces; pruning techniques; transparent fashion; Accuracy; Algorithm design and analysis; Clustering algorithms; Complexity theory; Decision trees; Machine learning algorithms; Training; clustering; data-dependency analysis; decision trees; hybrid intelligent systems; machine learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems Design and Applications (ISDA), 2011 11th International Conference on
Conference_Location :
Cordoba
ISSN :
2164-7143
Print_ISBN :
978-1-4577-1676-8
Type :
conf
DOI :
10.1109/ISDA.2011.6121712
Filename :
6121712
Link To Document :
بازگشت