Title :
A Re-sampling Method for Class Imbalance Learning with Credit Data
Author :
Zhang, Li ; Wang, WenXian
Author_Institution :
Sch. of Inf. Technol. & Manage. Eng., Univ. of Int. Bus. & Econ., Beijing, China
Abstract :
Credit rating is a typical class imbalance problem. Over sampling methods are commonly used for dealing with this problem. This paper presents an improved over sampling approach based on synthetic minority over-sampling technique(SMOTE). First, use sample distribution of the minority class to estimate whether different types of samples are crossed. Then generate synthetic samples by samples in different class when different classes cross seriously. In addition, increase weights of part positive samples, which may be not on borderline. At last, the proposed method is evaluated on a credit data set. The results indicate that it is more effective than other methods for the class imbalance learning.
Keywords :
data handling; sampling methods; class imbalance learning; credit data set; credit rating; imbalance problem; resampling method; sample distribution; synthetic minority over-sampling technique; synthetic samples; Classification algorithms; Logistics; Measurement; Noise; Support vector machines; Testing; Training; SMOTE; class imbalance; credit rating; sample distribution;
Conference_Titel :
Information Technology, Computer Engineering and Management Sciences (ICM), 2011 International Conference on
Conference_Location :
Nanjing, Jiangsu
Print_ISBN :
978-1-4577-1419-1
DOI :
10.1109/ICM.2011.34