Title :
Imbalanced Data Classification Based on a Hybrid Resampling SVM Method
Author :
Lu Cao;Yikui Zhai
Author_Institution :
Sch. of Inf. Eng., Wuyi Univ., Jiangmen, China
Abstract :
Imbalanced datasets are frequently founded in many different applications, causing poor predication performances for minority class. In the paper, a hybrid re-sampling approach was proposed to deal with the two-class imbalanced data classification. Firstly, SMOTE technique is used to generate synthetic points for the minority class, then, under-sampling technique was used to delete some samples of the majority with less classified information. Thus, relative balanced training datasets are generated and we use SVM to cope with the new dataset. Experimental results on a synthetic dataset and five benchmark UCI datasets are provided to show the effectiveness of the proposed method.
Keywords :
"Support vector machines","Training","Classification algorithms","Kernel","Glass","Breast cancer","Benchmark testing"
Conference_Titel :
Ubiquitous Intelligence and Computing and 2015 IEEE 12th Intl Conf on Autonomic and Trusted Computing and 2015 IEEE 15th Intl Conf on Scalable Computing and Communications and Its Associated Workshops (UIC-ATC-ScalCom), 2015 IEEE 12th Intl Conf on
DOI :
10.1109/UIC-ATC-ScalCom-CBDCom-IoP.2015.275