Title :
An Analysis of Properties of Malignant Cases for Imbalanced Breast Thermogram Feature Classification
Author :
Krawczyk, Bartosz ; Schaefer, Gerald
Author_Institution :
Dept. of Syst. & Comput. Networks, Wroclaw Univ. of Technol., Wrocław, Poland
Abstract :
Medical thermography has been demonstrated an effective and inexpensive method for detecting breast cancer, in particular for tumors in early stages and in dense tissue. Image features can be extracted from breast thermograms and used in a pattern classification stage for automated diagnosis and hence as a second objective opinion or for screening purposes. One of the main challenges for applying machine learning algorithms to this task is the high imbalance ratio between class distributions in the available training data. In this paper, we carefully examine the properties of the malignant minority class in order to gain insight into the nature of the data. We identify different types of minority class samples present in a breast thermogram dataset comprising about 150 cases. Using the gained knowledge, we analyse the performance of three state-of-the-art ensemble classifiers, a cost-sensitive one, one based on over-sampling and one using under-sampling, to evaluate which objects are the most difficult to classify correctly. Experimental analysis shows that there is a strong correlation between the type of minority sample and the performance of specific classifier ensemble types.
Keywords :
biomedical optical imaging; cancer; feature extraction; image classification; infrared imaging; learning (artificial intelligence); medical image processing; object detection; tumours; automated diagnosis; breast cancer detection; class distributions; classifier ensemble types; dense tissue; ensemble classifiers; high imbalance ratio; image feature extraction; imbalanced breast thermogram feature classification; machine learning algorithms; malignant cases; malignant minority class; medical thermography; pattern classification; screening purposes; tumors; Accuracy; Breast cancer; Feature extraction; Pattern recognition; Support vector machines; breast cancer; ensemble classifier; imbalanced classification; medical data analysis; multiple classifier system; pattern classification;
Conference_Titel :
Pattern Recognition (ACPR), 2013 2nd IAPR Asian Conference on
Conference_Location :
Naha
DOI :
10.1109/ACPR.2013.45