DocumentCode :
2903424
Title :
Application of Random-SMOTE on Imbalanced Data Mining
Author :
Li, Jia ; Li, Hui ; Yu, Jun-Ling
Author_Institution :
Sch. of Econ. & Manage., Zhejiang Normal Univ., Jinhua, China
fYear :
2011
fDate :
17-18 Oct. 2011
Firstpage :
130
Lastpage :
133
Abstract :
The performance of many classifiers based on balanced data sets can´t do well in imbalanced data sets. This article integrates the over-sampling method of Random-SMOTE (R-S), which is based on SMOTE method, in imbalanced data mining. We use the R-S method to increase the number of the minority randomly in the minority sample space until it is almost equal to the majority in data mining tasks. 5 UCI imbalanced data sets are balanced with the integrated data mining process. Log it algorithm is used for classification with these data sets. The result shows that the integrated use of R-S in data mining can improve the performance of the classifier significantly.
Keywords :
data mining; pattern classification; classification; imbalanced data mining; imbalanced data sets; minority sample space; over-sampling method; random-SMOTE; Accuracy; Classification algorithms; Data mining; Forecasting; Glass; Predictive models; Sampling methods; Data Mining; Imbalaced Data set; Integrated use of Random-SMOTE and logit;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Business Intelligence and Financial Engineering (BIFE), 2011 Fourth International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4577-1541-9
Type :
conf
DOI :
10.1109/BIFE.2011.25
Filename :
6121105
Link To Document :
بازگشت