Title :
Adaptive selection of ensembles for imbalanced class distributions
Author :
Radtke, P.V.W. ; Granger, E. ; Sabourin, R. ; Gorodnichy, Dmitry
Author_Institution :
Lab. d´´Imagerie, de Vision et d´´Intell. Artificielle - LIVIA, Univ. du Quebec, Montreal, QC, Canada
Abstract :
Boolean combination (BC) techniques have been shown to efficiently integrate the responses of multiple diversified classifiers in the ROC space to improve the overall accuracy and reliability of pattern recognition systems. In practice, since class distributions are often imbalanced and change over time, the BC of classifiers, and thus selection of ensembles, should be adapted to reflect operational conditions. Although the impact on classification performance of imbalanced distributions may be addressed using ensemble-based techniques, this is difficult to observe from ROC curves. However, given a desired false positive rate and class imbalance, performing BC in the Precision-Recall Operating Characteristic (PROC) space with skewed data may lead to a higher level of performance. In this paper, an adaptive system is proposed that initially generates several PROC curves, each one from data with a different level of skew. Then, during operations, the class imbalance is periodically estimated, and used to approximate the most accurate BC of classifiers among operational points of these curves. Simulation results indicate that this approach maintains a high level of accuracy that is comparable to full Boolean re-combination (as required for a specific level of imbalance), but for a significantly lower computational cost.
Keywords :
Boolean algebra; pattern classification; reliability; sensitivity analysis; Boolean combination techniques; Boolean recombination; PROC curves; PROC space; ROC curves; ROC space; adaptive ensemble selection; classification performance; classifier BC; ensemble-based techniques; false positive rate; imbalanced class distributions; multiple diversified classifiers; pattern recognition system reliability; precision-recall operating characteristic space; skewed data; Accuracy; Approximation algorithms; Face recognition; Receivers; Reliability; Training;
Conference_Titel :
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location :
Tsukuba
Print_ISBN :
978-1-4673-2216-4