Title :
An Improved SMOTE Imbalanced Data Classification Method Based on Support Degree
Author :
Kewen Li ; Wenrong Zhang ; Qinghua Lu ; Xianghua Fang
Author_Institution :
Coll. of Comput. & Commun. Eng., China Univ. of Pet., Qingdao, China
Abstract :
Imbalanced data-set Classification has become a hotspot problem in Data Mining. The essential assumption of the traditional classification algorithms is that the distribution of the classes is balanced, therefore the algorithms used in Imbalanced data-set Classification cannot achieve an ideal effect. In view of imbalance date-set classification, we propose an over sampling method based on support degree in order to guide people to select minority class samples and generate new minority class samples. In the light of support degree, it is now possible to identify minority class boundary samples, then produce a number of new samples between the boundary samples and their neighbors, finally add the synthetic samples to the original data-set to participate in training and testing. Experimental results show that the method has an obvious advantage in dealing with imbalanced data-set.
Keywords :
data mining; pattern classification; sampling methods; class distribution; data mining; imbalanced data-set classification; improved SMOTE imbalanced data classification method; minority class boundary sample identification; minority class sample generation; minority class sample selection; over sampling method; support degree; Algorithm design and analysis; Bagging; Classification algorithms; Computers; Data mining; Testing; Training; Boundary sample; Classification; Imbalanced data-sets; SMOTE; Support degree;
Conference_Titel :
Identification, Information and Knowledge in the Internet of Things (IIKI), 2014 International Conference on
DOI :
10.1109/IIKI.2014.14