DocumentCode :
578125
Title :
Active learning for imbalance problem using L-GEM of RBFNN
Author :
Hu, Junjie
Author_Institution :
Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou, China
Volume :
2
fYear :
2012
fDate :
15-17 July 2012
Firstpage :
490
Lastpage :
495
Abstract :
In lots of important applications, such as malignant cell detection, network intrusion detection, error signal detection in power system, the data distributions of positive and negative classes are usually imbalance. Many classifiers could not perform well in data imbalance cases. The major problem is that classifiers tend to ignore samples and accuracy of the minority class without regarding the higher cost of misclassification in this minor class. Therefore, pattern classification for imbalance data becomes a hot challenge to both academy and industry. In this paper, we propose an active learning method for imbalance data using a stochastic sensitivity measure (ST-SM) of Radial Basis Function Neural Network (RBFNN). A large ST-SM indicates the RBFNN is uncertain and yields a large output fluctuation around a particular sample. These samples yielding large ST-SM values are selected for adding to the training set in each turn. Empirically, samples with large output perturbation (i.e. large ST-SM) should be located near the classification boundary and is of great significance for the training of classifier. As for the imbalance characteristic of the data set, the ST-SM should be able to reduce the number of redundant samples being selected in the majority class, rebalance the sample distribution of the training set, and finally improve the performance of the classifier.
Keywords :
learning (artificial intelligence); pattern classification; radial basis function networks; stochastic processes; L-GEM; RBFNN; ST-SM; active learning method; classifiers; imbalance data problem; localized generalization error model; majority class; minority class; negative class data distributions; pattern classification; positive class data distributions; radial basis function neural network; stochastic sensitivity measure; training set; Abstracts; Active learning; Imbalance data; Localized Generalization Error Model; Sample selection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2012 International Conference on
Conference_Location :
Xian
ISSN :
2160-133X
Print_ISBN :
978-1-4673-1484-8
Type :
conf
DOI :
10.1109/ICMLC.2012.6358972
Filename :
6358972
Link To Document :
بازگشت