Title :
Pruning in Ordered Regression Bagging Ensembles
Author :
Hernández-Lobato, Daniel ; Martínez-Muñoz, Gonzalo ; Suárez, Alberto
Author_Institution :
Univ. Autonoma de Madrid, Madrid
Abstract :
An efficient procedure for pruning regression ensembles is introduced. Starting from a bagging ensemble, pruning proceeds by ordering the regressors in the original ensemble and then selecting a subset for aggregation. Ensembles of increasing size are built by including first the regressors that perform best when aggregated. This strategy gives an approximate solution to the problem of extracting from the original ensemble the minimum error subensemble, which we prove to be NP-hard. Experiments show that pruned ensembles with only 20% of the initial regressors achieve better generalization accuracies than the complete bagging ensembles. The performance of pruned ensembles is analyzed by means of the bias-variance decomposition of the error.
Keywords :
optimisation; regression analysis; NP-hard; minimum error subensemble; ordered regression bagging ensemble pruning; Algorithm design and analysis; Bagging; Computer science; Greedy algorithms; Performance analysis; Polynomials; Reflection; Sampling methods; Training data;
Conference_Titel :
Neural Networks, 2006. IJCNN '06. International Joint Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
0-7803-9490-9
DOI :
10.1109/IJCNN.2006.246837