Title :
A new algorithm for imbalanced datasets in presence of outliers and noise
Author :
Zhao, Shu-Juan ; Zhang, Hua-Peng ; Li, Lei
Author_Institution :
Coll. of Sci., Nanjing Univ. of Posts & Telecommun., Nanjing, China
Abstract :
In recent years, learning from imbalanced datasets has attracted much attention both in academic and industrial fields. The kernel modification method based on Riemannian metric is an effective method to handle the class imbalance problem. But it cannot deal with the outliers and noise in the imbalanced datasets. However, Fuzzy Support Vector Machine (FSVM) can deal with the outliers and noise in the balanced datasets. In this paper, we combine the FSVM with the kernel modification method based on Riemannian metric together to handle the imbalanced datasets in presence of outliers and noise. Experimental results on four UCI datasets show this method to be effective in improving class prediction accuracy.
Keywords :
data handling; fuzzy set theory; learning (artificial intelligence); support vector machines; FSVM; Riemannian metric; academic fields; fuzzy support vector machine; imbalanced datasets; industrial fields; kernel modification method; learning algorithm; new algorithm; noise presence; outliers presence; Accuracy; Kernel; Learning systems; Machine learning; Measurement; Noise; Support vector machines; FSVM(fuzzy support vector machines); imbalanced datasets; outliers and noise;
Conference_Titel :
Natural Computation (ICNC), 2012 Eighth International Conference on
Conference_Location :
Chongqing
Print_ISBN :
978-1-4577-2130-4
DOI :
10.1109/ICNC.2012.6234723