DocumentCode :
3239442
Title :
Overproduce-and-select: The grim reality
Author :
Johansson, Ulf ; Lofstrom, Tuve ; Bostrom, Henrik
Author_Institution :
Sch. of Bus., Univ. of Boras, Boras, Sweden
fYear :
2013
fDate :
16-19 April 2013
Firstpage :
52
Lastpage :
59
Abstract :
Overproduce-and-select (OPAS) is a frequently used paradigm for building ensembles. In static OPAS, a large number of base classifiers are trained, before a subset of the available models is selected to be combined into the final ensemble. In general, the selected classifiers are supposed to be accurate and diverse for the OPAS strategy to result in highly accurate ensembles, but exactly how this is enforced in the selection process is not obvious. Most often, either individual models or ensembles are evaluated, using some performance metric, on available and labeled data. Naturally, the underlying assumption is that an observed advantage for the models (or the resulting ensemble) will carry over to test data. In the experimental study, a typical static OPAS scenario, using a pool of artificial neural networks and a number of very natural and frequently used performance measures, is evaluated on 22 publicly available data sets. The discouraging result is that although a fairly large proportion of the ensembles obtained higher test set accuracies, compared to using the entire pool as the ensemble, none of the selection criteria could be used to identify these highly accurate ensembles. Despite only investigating a specific scenario, we argue that the settings used are typical for static OPAS, thus making the results general enough to question the entire paradigm.
Keywords :
learning (artificial intelligence); neural nets; pattern classification; set theory; artificial neural networks; base classifier training; ensembles; overproduce-and-select strategy; performance measures; performance metric; publicly available data sets; selection criteria; static OPAS strategy; Accuracy; Correlation; Diversity reception; Educational institutions; Measurement; Standards; Training; Overproduce-and-select ensembles; neural networks;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Ensemble Learning (CIEL), 2013 IEEE Symposium on
Conference_Location :
Singapore
Type :
conf
DOI :
10.1109/CIEL.2013.6613140
Filename :
6613140
Link To Document :
بازگشت