Title :
Imbalance data classification method based on cluster boundary sampling RF-Bagging
Author :
Peng Li; Jiuling Huang; Kaihui Zhang; Tingting Bi
Author_Institution :
Sch. of Software, Harbin Univ. of Sci. &
Abstract :
This paper proposed a method based on the cluster boundary sampling RF-Bagging to solve the problem of imbalance data classification. It uses the cluster boundaries sampling of down-sampling preprocessing training data and then uses SVM and RF two different base classifiers as learning algorithms, integrated training and learning before and after sampling data by bagging respectively, two contrast experiment results are obtained. Finally, we use ROC curves and AUC values as results of the evaluation. The experimental results show that the method can improve the classification effect of classifier, and deal with the imbalance of data classification problems effectively.
Conference_Titel :
Software Intelligence Technologies and Applications & International Conference on Frontiers of Internet of Things 2014, International Conference on
Print_ISBN :
978-1-84919-970-4
DOI :
10.1049/cp.2014.1580