DocumentCode
578125
Title
Active learning for imbalance problem using L-GEM of RBFNN
Author
Hu, Junjie
Author_Institution
Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou, China
Volume
2
fYear
2012
fDate
15-17 July 2012
Firstpage
490
Lastpage
495
Abstract
In lots of important applications, such as malignant cell detection, network intrusion detection, error signal detection in power system, the data distributions of positive and negative classes are usually imbalance. Many classifiers could not perform well in data imbalance cases. The major problem is that classifiers tend to ignore samples and accuracy of the minority class without regarding the higher cost of misclassification in this minor class. Therefore, pattern classification for imbalance data becomes a hot challenge to both academy and industry. In this paper, we propose an active learning method for imbalance data using a stochastic sensitivity measure (ST-SM) of Radial Basis Function Neural Network (RBFNN). A large ST-SM indicates the RBFNN is uncertain and yields a large output fluctuation around a particular sample. These samples yielding large ST-SM values are selected for adding to the training set in each turn. Empirically, samples with large output perturbation (i.e. large ST-SM) should be located near the classification boundary and is of great significance for the training of classifier. As for the imbalance characteristic of the data set, the ST-SM should be able to reduce the number of redundant samples being selected in the majority class, rebalance the sample distribution of the training set, and finally improve the performance of the classifier.
Keywords
learning (artificial intelligence); pattern classification; radial basis function networks; stochastic processes; L-GEM; RBFNN; ST-SM; active learning method; classifiers; imbalance data problem; localized generalization error model; majority class; minority class; negative class data distributions; pattern classification; positive class data distributions; radial basis function neural network; stochastic sensitivity measure; training set; Abstracts; Active learning; Imbalance data; Localized Generalization Error Model; Sample selection;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics (ICMLC), 2012 International Conference on
Conference_Location
Xian
ISSN
2160-133X
Print_ISBN
978-1-4673-1484-8
Type
conf
DOI
10.1109/ICMLC.2012.6358972
Filename
6358972
Link To Document