Title :
Active Learning with Adaptive Heterogeneous Ensembles
Author :
Lu, Zhenyu ; Wu, Xindong ; Bongard, Josh
Author_Institution :
Dept. of Comput. Sci., Univ. of Vermont, Burlington, VT, USA
Abstract :
One common approach to active learning is to iteratively train a single classifier by choosing data points based on its uncertainty, but it is nontrivial to design uncertainty measures unbiased by the choice of classifier. Query by committee suggests that given an ensemble of diverse but accurate classifiers, the most informative data points are those that cause maximal disagreement among the predictions of the ensemble members. However the method for finding ensembles appropriate to a given data set remains an open question. In this paper, the random subspace method is combined with active learning to create multiple instances of different classifier types, and an algorithm is introduced that adapts the ratio of different classifier types in the ensemble towards better overall accuracy. Here we show that the proposed algorithm outperforms C4.5 with uncertainty sampling, Naive Bayes with uncertainty sampling, bagging, boosting and the random subspace method with random sampling. To the best of our knowledge, our work is the first to adapt the ratio of classifiers in a heterogeneous ensemble for active learning.
Keywords :
Bayes methods; learning (artificial intelligence); random processes; uncertainty handling; active learning; adaptive heterogeneous ensembles; data points; maximal disagreement; naive Bayes; random sampling; random subspace method; uncertainty measures; uncertainty sampling; Bagging; Boosting; Computer science; Costs; Data mining; Labeling; Learning systems; Measurement uncertainty; Sampling methods; Stochastic processes; Active Learning; Adaptive Heterogeneous Ensembles; Ensemble Learning;
Conference_Titel :
Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4244-5242-2
Electronic_ISBN :
1550-4786
DOI :
10.1109/ICDM.2009.63