Title :
A Theoretical Analysis of Bagging as a Linear Combination of Classifiers
Author :
Fumera, Giorgio ; Roli, F. ; Serrau, A.
Author_Institution :
Dept. of Electr. & Electron. Eng., Univ. of Cagliari, Cagliari
fDate :
7/1/2008 12:00:00 AM
Abstract :
We apply an analytical framework for the analysis of linearly combined classifiers to ensembles generated by bagging. This provides an analytical model of bagging misclassification probability as a function of the ensemble size, which is a novel result in the literature. Experimental results on real data sets confirm the theoretical predictions. This allows us to derive a novel and theoretically grounded guideline for choosing bagging ensemble size. Furthermore, our results are consistent with explanations of bagging in terms of classifier instability and variance reduction, support the optimality of the simple average over the weighted average combining rule for ensembles generated by bagging, and apply to other randomization-based methods for constructing classifier ensembles. Although our results do not allow to compare bagging misclassification probability with the one of an individual classifier trained on the original training set, we discuss how the considered theoretical framework could be exploited to this aim.
Keywords :
pattern classification; probability; random processes; bagging misclassification probability; classifier instability; linearly combined classifier; randomization-based method; variance reduction; Bagging; Classifier Fusion; Linear Combiners; Multiple Classifier Systems; pattern classification.; Algorithms; Artificial Intelligence; Cluster Analysis; Computer Simulation; Linear Models; Pattern Recognition, Automated;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
DOI :
10.1109/TPAMI.2008.30