Title :
Comparing pure parallel ensemble creation techniques against bagging
Author :
Hall, Lawrence O. ; Bowyer, KevinW ; Banfield, Robert E. ; Bhadoria, Divya ; Kegelmeyer, W. Philip ; Eschrich, Steven
Author_Institution :
Comput. Sci. & Eng., Univ. of South Florida, Tampa, FL, USA
Abstract :
We experimentally evaluate randomization-based approaches to creating an ensemble of decision-tree classifiers. Unlike methods related to boosting, all of the eight approaches considered here create each classifier in an ensemble independently of the other classifiers. Experiments were performed on 28 publicly available datasets, using C4.5 release 8 as the base classifier. While each of the other seven approaches has some strengths, we find that none of them is consistently more accurate than standard bagging when tested for statistical significance.
Keywords :
bagging; decision trees; learning (artificial intelligence); pattern classification; random processes; statistical testing; bagging; boosting; decision-tree classifier; parallel ensemble creation techniques; randomization; statistical testing; training datasets; Bagging; Boosting; Computer science; Data mining; Decision trees; Testing;
Conference_Titel :
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN :
0-7695-1978-4
DOI :
10.1109/ICDM.2003.1250970