Title :
Application of Random-SMOTE on Imbalanced Data Mining
Author :
Li, Jia ; Li, Hui ; Yu, Jun-Ling
Author_Institution :
Sch. of Econ. & Manage., Zhejiang Normal Univ., Jinhua, China
Abstract :
The performance of many classifiers based on balanced data sets can´t do well in imbalanced data sets. This article integrates the over-sampling method of Random-SMOTE (R-S), which is based on SMOTE method, in imbalanced data mining. We use the R-S method to increase the number of the minority randomly in the minority sample space until it is almost equal to the majority in data mining tasks. 5 UCI imbalanced data sets are balanced with the integrated data mining process. Log it algorithm is used for classification with these data sets. The result shows that the integrated use of R-S in data mining can improve the performance of the classifier significantly.
Keywords :
data mining; pattern classification; classification; imbalanced data mining; imbalanced data sets; minority sample space; over-sampling method; random-SMOTE; Accuracy; Classification algorithms; Data mining; Forecasting; Glass; Predictive models; Sampling methods; Data Mining; Imbalaced Data set; Integrated use of Random-SMOTE and logit;
Conference_Titel :
Business Intelligence and Financial Engineering (BIFE), 2011 Fourth International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4577-1541-9
DOI :
10.1109/BIFE.2011.25