Title :
One-Class versus Binary Classification: Which and When?
Author :
Bellinger, C. ; Sharma, Shantanu ; Japkowicz, Nathalie
Author_Institution :
SITE, Univ. of Ottawa, Ottawa, ON, Canada
Abstract :
Binary classifiers have typically been the norm for building classification models in the Machine Learning community. However, an alternate to binary classification is one-class classification, which aims to build models using only a single class of data. This is particularly useful when there is an over-abundance of data of a particular class. In such imbalanced cases, binary classifiers may not perform very well, and one-class classifiers then become the viable option. In this paper, we are interested in investigating the performance of binary and one-class classifiers as the level of imbalance increases, and, thus, uncertainty in the second class. Our objective is to gain insight into which classification paradigm becomes more suitable as imbalance and uncertainty increase. To this end, we conduct experiments on various datasets, both artificial and from the UCI repository, and monitor the performance of the binary and one-class classifiers as the size of the second class gradually decreases, thus increasing the level of imbalance. The results show that as the level of imbalance increases, the performance of binary classifiers decreases, whereas one-class classifiers stay relatively stable.
Keywords :
learning (artificial intelligence); pattern classification; uncertainty handling; UCI dataset; binary classification; imbalanced data; machine learning; one-class classification; uncertainty handling; Data models; Diabetes; Diseases; Heart; Market research; Probability density function; Support vector machines; Machine learning; binary classification; imbalanced data; one-class classification;
Conference_Titel :
Machine Learning and Applications (ICMLA), 2012 11th International Conference on
Conference_Location :
Boca Raton, FL
Print_ISBN :
978-1-4673-4651-1
DOI :
10.1109/ICMLA.2012.212