DocumentCode
2811369
Title
Local cost sensitive learning for handling imbalanced data sets
Author
Karagiannopoulos, M.G. ; Anyfantis, D.S. ; Kotsiantis, S.B. ; Pintelas, P.E.
Author_Institution
Univ. of Patras, Patras
fYear
2007
fDate
27-29 June 2007
Firstpage
1
Lastpage
6
Abstract
Many real-world data sets exhibit skewed class distributions in which almost all cases are allotted to a class and far fewer cases to a smaller, usually more interesting class. A classifier induced from an imbalanced data set has, typically, a low error rate for the majority class and an unacceptable error rate for the minority class. This paper firstly provides a systematic study on the various methodologies that have tried to handle this problem. Finally, it presents an experimental study of these methodologies with a proposed local cost sensitive technique and it concludes that such a framework can be a more effective solution to the problem. Our method seems to allow improved identification of difficult small classes in predictive analysis, while keeping the classification ability of the other classes in an acceptable level.
Keywords
data handling; learning (artificial intelligence); pattern classification; class identification; data classifier; imbalanced data set handling; local cost sensitive learning; predictive analysis; skewed class distribution; supervised machine learning; Bayesian methods; Classification tree analysis; Costs; Decision trees; Error analysis; Laboratories; Machine learning; Mathematics; Telephony; Training data; imbalanced data sets; local learning; supervised machine learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Control & Automation, 2007. MED '07. Mediterranean Conference on
Conference_Location
Athens
Print_ISBN
978-1-4244-1282-2
Electronic_ISBN
978-1-4244-1282-2
Type
conf
DOI
10.1109/MED.2007.4433808
Filename
4433808
Link To Document