DocumentCode :
2063355
Title :
Comparing SVM ensembles for imbalanced datasets
Author :
Bhatnagar, Vasudha ; Bhardwaj, Manju ; Mahabal, Ashish
Author_Institution :
Dept. of Comput. Sci., Delhi Univ., Delhi, India
fYear :
2010
fDate :
Nov. 29 2010-Dec. 1 2010
Firstpage :
651
Lastpage :
657
Abstract :
Real life datasets often suffer from the problem of class imbalance, which thwarts supervised learning process. In such data sets examples of positive (minority) class are significantly less than those of negative (majority) class leading to severe class imbalance. Constructing high quality classifiers for such imbalanced training data sets is one of the major challenges in machine learning, since traditional classification algorithms tend to get biased towards majority class. In this paper, we compare three ensemble based approaches for handling imbalanced datasets. All the three approaches aim to overcome the under representation of minority class by learning from each of the minority class samples and a subset of majority class samples. The three approaches namely, PARTEN, UMjC and LFM were evaluated on several public datasets. Precision, recall, F- measure, g-mean and ROC space measures were used for comparison. Thread-bare discussion of the results is presented in the paper. Subsequently, we present an astronomy application, where the three methods are compared for prediction of class II, IIn and IIp supernovae.
Keywords :
data handling; learning (artificial intelligence); pattern classification; support vector machines; F-measure; LFM; PARTEN; ROC space measure; SVM ensemble; UMjC; high quality classifier; imbalanced dataset handling; imbalanced training data set; machine learning; supervised learning; Class imbalance; Classification; Ensembles; SVM; Supervised Learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems Design and Applications (ISDA), 2010 10th International Conference on
Conference_Location :
Cairo
Print_ISBN :
978-1-4244-8134-7
Type :
conf
DOI :
10.1109/ISDA.2010.5687191
Filename :
5687191
Link To Document :
بازگشت