Title :
Classification for imbalanced dataset based on biased empirical feature mapping
Author :
Zhiming Yang ; Yu Yang ; Wang Gang
Author_Institution :
Dept. of Autom. Test & Control, Harbin Inst. of Technol., Harbin, China
Abstract :
It is shown that an imbalanced datasets can pose serious problems to many real-world classification tasks when support vector machines is used as the learning machine. To solve this problem, we propose a modified method based on biased empirical feature mapping. In the new method, biased discriminant analysis was applied to make all majority samples far away from center of minority samples in empirical feature space, so that generalization ability of the classifier for minority samples can be improved. Through theoretical analysis and empirical study on synthetic datasets and UCI datasets, we show that our method augments the classification accuracy rate effectively.
Keywords :
learning (artificial intelligence); pattern classification; support vector machines; UCI datasets; biased discriminant analysis; biased empirical feature mapping; classification accuracy rate; imbalanced datasets; machine learning; real-world classification tasks; support vector machines; synthetic datasets; Accuracy; Kernel; Optimization methods; Support vector machines; Training; Vectors; Imbalanced data; biased discriminant analysis; empirical feature mapping; support vector machines;
Conference_Titel :
Instrumentation and Measurement Technology Conference (I2MTC), 2012 IEEE International
Conference_Location :
Graz
Print_ISBN :
978-1-4577-1773-4
DOI :
10.1109/I2MTC.2012.6229164