• DocumentCode
    2903424
  • Title

    Application of Random-SMOTE on Imbalanced Data Mining

  • Author

    Li, Jia ; Li, Hui ; Yu, Jun-Ling

  • Author_Institution
    Sch. of Econ. & Manage., Zhejiang Normal Univ., Jinhua, China
  • fYear
    2011
  • fDate
    17-18 Oct. 2011
  • Firstpage
    130
  • Lastpage
    133
  • Abstract
    The performance of many classifiers based on balanced data sets can´t do well in imbalanced data sets. This article integrates the over-sampling method of Random-SMOTE (R-S), which is based on SMOTE method, in imbalanced data mining. We use the R-S method to increase the number of the minority randomly in the minority sample space until it is almost equal to the majority in data mining tasks. 5 UCI imbalanced data sets are balanced with the integrated data mining process. Log it algorithm is used for classification with these data sets. The result shows that the integrated use of R-S in data mining can improve the performance of the classifier significantly.
  • Keywords
    data mining; pattern classification; classification; imbalanced data mining; imbalanced data sets; minority sample space; over-sampling method; random-SMOTE; Accuracy; Classification algorithms; Data mining; Forecasting; Glass; Predictive models; Sampling methods; Data Mining; Imbalaced Data set; Integrated use of Random-SMOTE and logit;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Business Intelligence and Financial Engineering (BIFE), 2011 Fourth International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4577-1541-9
  • Type

    conf

  • DOI
    10.1109/BIFE.2011.25
  • Filename
    6121105