• DocumentCode
    2811369
  • Title

    Local cost sensitive learning for handling imbalanced data sets

  • Author

    Karagiannopoulos, M.G. ; Anyfantis, D.S. ; Kotsiantis, S.B. ; Pintelas, P.E.

  • Author_Institution
    Univ. of Patras, Patras
  • fYear
    2007
  • fDate
    27-29 June 2007
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Many real-world data sets exhibit skewed class distributions in which almost all cases are allotted to a class and far fewer cases to a smaller, usually more interesting class. A classifier induced from an imbalanced data set has, typically, a low error rate for the majority class and an unacceptable error rate for the minority class. This paper firstly provides a systematic study on the various methodologies that have tried to handle this problem. Finally, it presents an experimental study of these methodologies with a proposed local cost sensitive technique and it concludes that such a framework can be a more effective solution to the problem. Our method seems to allow improved identification of difficult small classes in predictive analysis, while keeping the classification ability of the other classes in an acceptable level.
  • Keywords
    data handling; learning (artificial intelligence); pattern classification; class identification; data classifier; imbalanced data set handling; local cost sensitive learning; predictive analysis; skewed class distribution; supervised machine learning; Bayesian methods; Classification tree analysis; Costs; Decision trees; Error analysis; Laboratories; Machine learning; Mathematics; Telephony; Training data; imbalanced data sets; local learning; supervised machine learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control & Automation, 2007. MED '07. Mediterranean Conference on
  • Conference_Location
    Athens
  • Print_ISBN
    978-1-4244-1282-2
  • Electronic_ISBN
    978-1-4244-1282-2
  • Type

    conf

  • DOI
    10.1109/MED.2007.4433808
  • Filename
    4433808