DocumentCode
1758998
Title
A Double Pruning Scheme for Boosting Ensembles
Author
Soto, Victor ; Garcia-Moratilla, Sergio ; Martinez-Munoz, Gonzalo ; Hernandez-Lobato, Daniel ; Suarez, Almudena
Author_Institution
Escuela Politec. Super., Univ. Autonoma de Madrid, Cantoblanco, Spain
Volume
44
Issue
12
fYear
2014
fDate
Dec. 2014
Firstpage
2682
Lastpage
2695
Abstract
Ensemble learning consists of generating a collection of classifiers whose predictions are then combined to yield a single unified decision. Ensembles of complementary classifiers provide accurate and robust predictions, which are often better than the predictions of the individual classifiers in the ensemble. Nevertheless, ensembles also have some drawbacks: typically, all classifiers are queried to compute the final ensemble prediction. Therefore, all the classifiers need to be accessible to address potential queries. This entails larger storage requirements and slower predictions than a single classifier. Ensemble pruning techniques are useful to alleviate these drawbacks. Static pruning techniques reduce the ensemble size by selecting a sub-ensemble of classifiers from the original ensemble. In dynamic pruning, the querying process is halted when the partial ensemble prediction is sufficient to reach a stable final decision with a reasonable amount of confidence. In this paper, we present the results of a comprehensive analysis of static and dynamic pruning techniques applied to Adaboost ensembles. These ensemble pruning techniques are evaluated on a wide range of classification problems. From this analysis, one concludes that the combination of static and dynamic pruning techniques provides a notable reduction in the memory requirements and an improvement in the classification time without a significant loss of prediction accuracy.
Keywords
learning (artificial intelligence); pattern classification; query processing; storage management; Adaboost ensembles; boosting ensembles; classification problems; classification time; complementary classifiers; double pruning scheme; dynamic pruning techniques; ensemble learning; ensemble prediction; ensemble pruning techniques; ensemble size; memory requirements; querying process; static pruning techniques; storage requirements; Accuracy; Boosting; Heuristic algorithms; Memory management; Prediction algorithms; Training; Vectors; Adaboost; double pruning; ensemble pruning; instance-based pruning; semi-definite programming;
fLanguage
English
Journal_Title
Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
2168-2267
Type
jour
DOI
10.1109/TCYB.2014.2313638
Filename
6805606
Link To Document