Title :
On the duel between the top 10 percent classifiers versus the bottom 90 percent classifiers
Author :
Chang, Kung-Hua ; Parker, D. Stott
Author_Institution :
Comput. Sci. Dept., UCLA, Los Angeles, CA, USA
Abstract :
Over the past two decades a number of methods have been proposed for constructing stronger classifiers as an ensemble of simpler individual classifiers. These simpler classifiers are often limited to a single family, and candidates among these are further limited to only elite classifiers, i.e., classifiers having highest performance. Many prior research papers suggest that using the top 10%-20% of elite classifiers in an ensemble should yield stronger classifiers. In this paper, we have explored the top 10% of elite classifiers as well as the bottom 90% of non-elite classifiers, proposed algorithms that can effectively use them to construct much stronger classifiers, and showed that using the bottom 90% can yield comparable and sometimes better performance than using the top 10%. Our algorithm is based on heuristic search algorithms for developing ensembles of diverse classifiers that optimize complementarity among them, and the experiments are made with 9 publicly available datasets using 22 different kinds of classifiers.
Keywords :
data mining; pattern classification; search problems; bottom 90 percent classifiers; data mining; diverse classifier ensemble; heuristic search algorithms; nonelite classifiers; stronger classifiers; top 10 percent classifiers; Accuracy; Classification algorithms; Computational fluid dynamics; Ionosphere; Iris; Support vector machine classification; Training; Data Mining; Heuristic algorithms; Machine learning algorithms;
Conference_Titel :
Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
Conference_Location :
Montreal, QC
Print_ISBN :
978-1-4673-0381-1
Electronic_ISBN :
978-1-4673-0380-4
DOI :
10.1109/ISSPA.2012.6310548