Title :
The effect of class imbalance on case selection for case-based classifiers, with emphasis on computer-aided diagnosis systems
Author :
Malof, Jordan M. ; Mazurowski, Maciej A. ; Tourassi, Georgia D.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Louisville, Louisville, KY, USA
Abstract :
In this paper, the effect of class imbalance in the case base of a case-based classifier is investigated as it pertains to case base reduction and the resulting classifier performance. A k-nearest neighbor algorithm is used as a classifier and the random mutation hill climbing (RMHC) algorithm is used for case base reduction. The effects at various levels of positive class prevalence are tested in a binary classification problem. The results indicate that class imbalance is detrimental to both case base reduction and classifier performance. Selection with RMHC generally improves the classification performance regardless of the case base prevalence.
Keywords :
case-based reasoning; medical diagnostic computing; pattern classification; random processes; binary classification; case base reduction; case-based classifier; class imbalance; computer-aided diagnosis system; k-nearest neighbor algorithm; random mutation hill climbing; Biomedical imaging; Computational efficiency; Computer aided diagnosis; Computer networks; Delay; Genetic mutations; Image storage; Machine learning; Medical diagnostic imaging; Neural networks; Cased-Based Learning; Computer-Aided Decision; Imbalance;
Conference_Titel :
Neural Networks, 2009. IJCNN 2009. International Joint Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4244-3548-7
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2009.5178759