DocumentCode :
1240239
Title :
An experimental bias-variance analysis of SVM ensembles based on resampling techniques
Author :
Valentini, Giorgio
Author_Institution :
DSI-Dipt. di Sci. dell´´Informazione, Univ. degli Studi di Milano, Italy
Volume :
35
Issue :
6
fYear :
2005
Firstpage :
1252
Lastpage :
1271
Abstract :
Recently, bias-variance decomposition of error has been used as a tool to study the behavior of learning algorithms and to develop new ensemble methods well suited to the bias-variance characteristics of base learners. We propose methods and procedures, based on Domingo´s unified bias-variance theory, to evaluate and quantitatively measure the bias-variance decomposition of error in ensembles of learning machines. We apply these methods to study and compare the bias-variance characteristics of single support vector machines (SVMs) and ensembles of SVMs based on resampling techniques, and their relationships with the cardinality of the training samples. In particular, we present an experimental bias-variance analysis of bagged and random aggregated ensembles of SVMs in order to verify their theoretical variance reduction properties. The experimental bias-variance analysis quantitatively characterizes the relationships between bagging and random aggregating, and explains the reasons why ensembles built on small subsamples of the data work with large databases. Our analysis also suggests new directions for research to improve on classical bagging.
Keywords :
covariance analysis; sampling methods; support vector machines; SVM ensembles; bias-variance analysis; large databases; learning algorithm; random aggregating; resampling techniques; support vector machines; Analysis of variance; Bagging; Data analysis; Databases; Decision trees; Kernel; Machine learning; Statistical analysis; Stochastic processes; Support vector machines; Bagging; bias-variance analysis; ensemble of learning machines; support vector machines; Algorithms; Analysis of Variance; Artificial Intelligence; Computer Simulation; Data Interpretation, Statistical; Models, Statistical; Pattern Recognition, Automated; Signal Processing, Computer-Assisted;
fLanguage :
English
Journal_Title :
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
1083-4419
Type :
jour
DOI :
10.1109/TSMCB.2005.850183
Filename :
1542270
Link To Document :
بازگشت