DocumentCode :
2811369
Title :
Local cost sensitive learning for handling imbalanced data sets
Author :
Karagiannopoulos, M.G. ; Anyfantis, D.S. ; Kotsiantis, S.B. ; Pintelas, P.E.
Author_Institution :
Univ. of Patras, Patras
fYear :
2007
fDate :
27-29 June 2007
Firstpage :
1
Lastpage :
6
Abstract :
Many real-world data sets exhibit skewed class distributions in which almost all cases are allotted to a class and far fewer cases to a smaller, usually more interesting class. A classifier induced from an imbalanced data set has, typically, a low error rate for the majority class and an unacceptable error rate for the minority class. This paper firstly provides a systematic study on the various methodologies that have tried to handle this problem. Finally, it presents an experimental study of these methodologies with a proposed local cost sensitive technique and it concludes that such a framework can be a more effective solution to the problem. Our method seems to allow improved identification of difficult small classes in predictive analysis, while keeping the classification ability of the other classes in an acceptable level.
Keywords :
data handling; learning (artificial intelligence); pattern classification; class identification; data classifier; imbalanced data set handling; local cost sensitive learning; predictive analysis; skewed class distribution; supervised machine learning; Bayesian methods; Classification tree analysis; Costs; Decision trees; Error analysis; Laboratories; Machine learning; Mathematics; Telephony; Training data; imbalanced data sets; local learning; supervised machine learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control & Automation, 2007. MED '07. Mediterranean Conference on
Conference_Location :
Athens
Print_ISBN :
978-1-4244-1282-2
Electronic_ISBN :
978-1-4244-1282-2
Type :
conf
DOI :
10.1109/MED.2007.4433808
Filename :
4433808
Link To Document :
بازگشت