• DocumentCode
    607366
  • Title

    A modified error function for imbalanced dataset classification problem

  • Author

    Vorraboot, P. ; Rasmequan, Suwanna ; Lursinsap, C. ; Chinnasarn, Krisana

  • Author_Institution
    Fac. of Inf., Burapha Univ., Chonburi, Thailand
  • fYear
    2012
  • fDate
    3-5 Dec. 2012
  • Firstpage
    854
  • Lastpage
    859
  • Abstract
    The objective of learning is to achieve the least error rate. In this paper we proposed a modified cost function as a means to properly measure error rate for imbalanced dataset. Most cost functions apply the same weights to all classes. However, it has been known that for imbalanced problem, the number of instances in the majority class is larger than the minority class. Therefore, the application of equal weight to all classes will significantly lead to improper classification boundary. That is, for most learning model, the minority class would be dominated by majority class which then causes a misclassification on the minority class. The objective of this paper is to find the appropriate parameters to improve MSE cost function based on overlap ratio and class distribution ratio. Back-propagation algorithm with the proposed modified cost function is used to solve two-class classification problem. UCI datasets are used for the experimentation. The results show that the modified MSE cost function provides a better result than the standard one, based on True-positive rate, G-Mean, and F-measurement.
  • Keywords
    backpropagation; mean square error methods; pattern classification; F-measurement; G-Mean; MSE cost function; UCI datasets; back-propagation algorithm; classification boundary; imbalanced dataset classification problem; least error rate; modified cost function; modified error function; true-positive rate; classification; error function; imbalanced dataset classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing and Convergence Technology (ICCCT), 2012 7th International Conference on
  • Conference_Location
    Seoul
  • Print_ISBN
    978-1-4673-0894-6
  • Type

    conf

  • Filename
    6530455