DocumentCode :
2985128
Title :
ConfDTree: Improving Decision Trees Using Confidence Intervals
Author :
Katz, Gil ; Shabtai, Asaf ; Rokach, L. ; Ofek, N.
Author_Institution :
Dept. of Inf. Syst. Eng., Ben-Gurion Univ. of the Negev, Beer-Sheva, Israel
fYear :
2012
fDate :
10-13 Dec. 2012
Firstpage :
339
Lastpage :
348
Abstract :
Decision trees have three main disadvantages: reduced performance when the training set is small, rigid decision criteria and the fact that a single "uncharacteristic" attribute might "derail" the classification process. In this paper we present ConfDTree - a post-processing method which enables decision trees to better classify outlier instances. This method, which can be applied on any decision trees algorithm, uses confidence intervals in order to identify these hard-to-classify instances and proposes alternative routes. The experimental study indicates that the proposed post-processing method consistently and significantly improves the predictive performance of decision trees, particularly for small, imbalanced or multi-class datasets in which an average improvement of 5%-9% in the AUC performance is reported.
Keywords :
decision trees; pattern classification; set theory; AUC performance; ConfDTree; classification process; confidence intervals; decision criteria; decision trees algorithm; hard-to-classify instances; multiclass datasets; post-processing method; training set; Classification algorithms; Decision trees; Gaussian distribution; Prediction algorithms; Standards; Training; Vegetation; confidence intervals; decision trees; imbalanced datasets;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
ISSN :
1550-4786
Print_ISBN :
978-1-4673-4649-8
Type :
conf
DOI :
10.1109/ICDM.2012.19
Filename :
6413889
Link To Document :
بازگشت