• DocumentCode
    2771434
  • Title

    Active Learning with Adaptive Heterogeneous Ensembles

  • Author

    Lu, Zhenyu ; Wu, Xindong ; Bongard, Josh

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Vermont, Burlington, VT, USA
  • fYear
    2009
  • fDate
    6-9 Dec. 2009
  • Firstpage
    327
  • Lastpage
    336
  • Abstract
    One common approach to active learning is to iteratively train a single classifier by choosing data points based on its uncertainty, but it is nontrivial to design uncertainty measures unbiased by the choice of classifier. Query by committee suggests that given an ensemble of diverse but accurate classifiers, the most informative data points are those that cause maximal disagreement among the predictions of the ensemble members. However the method for finding ensembles appropriate to a given data set remains an open question. In this paper, the random subspace method is combined with active learning to create multiple instances of different classifier types, and an algorithm is introduced that adapts the ratio of different classifier types in the ensemble towards better overall accuracy. Here we show that the proposed algorithm outperforms C4.5 with uncertainty sampling, Naive Bayes with uncertainty sampling, bagging, boosting and the random subspace method with random sampling. To the best of our knowledge, our work is the first to adapt the ratio of classifiers in a heterogeneous ensemble for active learning.
  • Keywords
    Bayes methods; learning (artificial intelligence); random processes; uncertainty handling; active learning; adaptive heterogeneous ensembles; data points; maximal disagreement; naive Bayes; random sampling; random subspace method; uncertainty measures; uncertainty sampling; Bagging; Boosting; Computer science; Costs; Data mining; Labeling; Learning systems; Measurement uncertainty; Sampling methods; Stochastic processes; Active Learning; Adaptive Heterogeneous Ensembles; Ensemble Learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on
  • Conference_Location
    Miami, FL
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4244-5242-2
  • Electronic_ISBN
    1550-4786
  • Type

    conf

  • DOI
    10.1109/ICDM.2009.63
  • Filename
    5360258