Title :
Weighted conditional mutual information based boosting for classification of imbalanced datasets
Author_Institution :
Comput. Autom. Res. Inst., Budapest, Hungary
Abstract :
This paper addresses the problem of binary classifier learning when the training data is imbalanced, i.e. the samples of the two classes have significantly different cardinality. We investigate two different cost-sensitive approaches in the conditional mutual information (CMI) based weak classifier selection procedure using histogram descriptors. The first method uses CMI for classifier selection, and cost factors are utilized in the construction of the final boosted classifier using support vector machine learning. In the second approach these costs are incorporated into the classifier selection step by weighting the CMI (wCMI). We evaluate the proposed methods in object recognition and detection tasks using two popular histogram-like descriptors. Extensive experiments showed that the proposed methods provide efficient tools to address both problems.
Keywords :
image classification; learning (artificial intelligence); object detection; object recognition; support vector machines; CMI-based weak classifier selection procedure; binary classifier learning; boosted classifier; conditional mutual information-based weak classifier selection procedure; cost-sensitive approaches; histogram descriptors; histogram-like descriptors; imbalanced dataset classification; object detection; object recognition; support vector machine learning; training data; weighted conditional mutual information-based boosting; Boosting; Face; Feature extraction; Histograms; Mutual information; Testing; Training;
Conference_Titel :
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location :
Tsukuba
Print_ISBN :
978-1-4673-2216-4