DocumentCode :
2953852
Title :
Combination approach of SMOTE and biased-SVM for imbalanced datasets
Author :
He-Yong Wang
Author_Institution :
Coll. of E-Bus., South China Univ. of Technol., Guangzhou
fYear :
2008
fDate :
1-8 June 2008
Firstpage :
228
Lastpage :
231
Abstract :
A new approach to construct the classifiers from imbalanced datasets is proposed by combining SMOTE (synthetic minority over-sampling technique) and Biased-SVM (biased support vector machine) approaches. A dataset is imbalanced if the classification categories are not approximately equally represented. Often real-world data sets are predominately composed of ldquonormalrdquo examples with only a small percentage of ldquoabnormalrdquo or ldquointerestingrdquo examples. The cost of misclassifying an abnormal (interesting) example into a normal example is often much higher than that of the reverse error. It was known as a means of increasing the sensitivity of a classifier to the minority class using SMOTE over-sampling in minority class. But in this paper, it gives a good means of increasing the sensitivity of a classifier to the minority class by using SMOTE approaches within support vectors. As for support vector over-sampling, this paper proposes two different over-sampling algorithms to deal with the support vectors being over-sampled by its neighbors from the k nearest neighbors, not only within the support vectors but also within the entire minority class. Some experimental results confirms that the proposed combination approach of SMOTE and biased-SVM can achieve better classifier performance.
Keywords :
data analysis; pattern classification; support vector machines; SMOTE; biased support vector machine; biased-SVM; classification categories; imbalanced datasets; k nearest neighbors; synthetic minority over-sampling technique; Neural networks;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location :
Hong Kong
ISSN :
1098-7576
Print_ISBN :
978-1-4244-1820-6
Electronic_ISBN :
1098-7576
Type :
conf
DOI :
10.1109/IJCNN.2008.4633794
Filename :
4633794
Link To Document :
بازگشت