• DocumentCode
    578125
  • Title

    Active learning for imbalance problem using L-GEM of RBFNN

  • Author

    Hu, Junjie

  • Author_Institution
    Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou, China
  • Volume
    2
  • fYear
    2012
  • fDate
    15-17 July 2012
  • Firstpage
    490
  • Lastpage
    495
  • Abstract
    In lots of important applications, such as malignant cell detection, network intrusion detection, error signal detection in power system, the data distributions of positive and negative classes are usually imbalance. Many classifiers could not perform well in data imbalance cases. The major problem is that classifiers tend to ignore samples and accuracy of the minority class without regarding the higher cost of misclassification in this minor class. Therefore, pattern classification for imbalance data becomes a hot challenge to both academy and industry. In this paper, we propose an active learning method for imbalance data using a stochastic sensitivity measure (ST-SM) of Radial Basis Function Neural Network (RBFNN). A large ST-SM indicates the RBFNN is uncertain and yields a large output fluctuation around a particular sample. These samples yielding large ST-SM values are selected for adding to the training set in each turn. Empirically, samples with large output perturbation (i.e. large ST-SM) should be located near the classification boundary and is of great significance for the training of classifier. As for the imbalance characteristic of the data set, the ST-SM should be able to reduce the number of redundant samples being selected in the majority class, rebalance the sample distribution of the training set, and finally improve the performance of the classifier.
  • Keywords
    learning (artificial intelligence); pattern classification; radial basis function networks; stochastic processes; L-GEM; RBFNN; ST-SM; active learning method; classifiers; imbalance data problem; localized generalization error model; majority class; minority class; negative class data distributions; pattern classification; positive class data distributions; radial basis function neural network; stochastic sensitivity measure; training set; Abstracts; Active learning; Imbalance data; Localized Generalization Error Model; Sample selection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics (ICMLC), 2012 International Conference on
  • Conference_Location
    Xian
  • ISSN
    2160-133X
  • Print_ISBN
    978-1-4673-1484-8
  • Type

    conf

  • DOI
    10.1109/ICMLC.2012.6358972
  • Filename
    6358972