DocumentCode
2903424
Title
Application of Random-SMOTE on Imbalanced Data Mining
Author
Li, Jia ; Li, Hui ; Yu, Jun-Ling
Author_Institution
Sch. of Econ. & Manage., Zhejiang Normal Univ., Jinhua, China
fYear
2011
fDate
17-18 Oct. 2011
Firstpage
130
Lastpage
133
Abstract
The performance of many classifiers based on balanced data sets can´t do well in imbalanced data sets. This article integrates the over-sampling method of Random-SMOTE (R-S), which is based on SMOTE method, in imbalanced data mining. We use the R-S method to increase the number of the minority randomly in the minority sample space until it is almost equal to the majority in data mining tasks. 5 UCI imbalanced data sets are balanced with the integrated data mining process. Log it algorithm is used for classification with these data sets. The result shows that the integrated use of R-S in data mining can improve the performance of the classifier significantly.
Keywords
data mining; pattern classification; classification; imbalanced data mining; imbalanced data sets; minority sample space; over-sampling method; random-SMOTE; Accuracy; Classification algorithms; Data mining; Forecasting; Glass; Predictive models; Sampling methods; Data Mining; Imbalaced Data set; Integrated use of Random-SMOTE and logit;
fLanguage
English
Publisher
ieee
Conference_Titel
Business Intelligence and Financial Engineering (BIFE), 2011 Fourth International Conference on
Conference_Location
Wuhan
Print_ISBN
978-1-4577-1541-9
Type
conf
DOI
10.1109/BIFE.2011.25
Filename
6121105
Link To Document