DocumentCode :
2210749
Title :
Active classifier training with the 3DS strategy
Author :
Reitmaier, Tobias ; Sick, Bernhard
Author_Institution :
Inst. for Software Syst. in Tech. Applic., Univ. of Passau, Passau, Germany
fYear :
2011
fDate :
11-15 April 2011
Firstpage :
88
Lastpage :
95
Abstract :
In this article, we introduce and investigate 3DS, a novel selection strategy for pool-based active training of a generative classifier, namely CMM (classifier based on a probabilistic mixture model). Such a generative classifier aims at modeling the processes underlying the “generation” of the data. The strategy 3DS considers the distance of samples to the decision boundary, the density in regions where samples are selected, and the diversity of samples in the query set that are chosen for labeling, e.g., by a human domain expert. The combination of the three measures in 3DS is adaptive in the sense that the weights of the distance and the density measure depend on the uniqueness of the classification. With nine benchmark data sets it is shown that 3DS outperforms a random selection strategy (baseline method), a pure closest sampling approach, ITDS (information theoretic diversity sampling), DWUS (density-weighted uncertainty sampling), DUAL (dual strategy for active learning), and PBAC (prototype based active learning) regarding evaluation criteria such as ranked performance based on classification accuracy, number of labeled samples (data utilization), and learning speed assessed by the area under the learning curve.
Keywords :
data mining; learning (artificial intelligence); pattern classification; sampling methods; 3DS strategy; CMM classifier; DUAL; DWUS; ITDS; PBAC; active classifier training; closest sampling approach; decision boundary; human domain expert; learning speed; pool based active training; random selection strategy; Accuracy; Coordinate measuring machines; Current measurement; Density measurement; Support vector machines; Training; Uncertainty; 3DS strategy; active learning; classification; generative modeling; selection strategy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Data Mining (CIDM), 2011 IEEE Symposium on
Conference_Location :
Paris
Print_ISBN :
978-1-4244-9926-7
Type :
conf
DOI :
10.1109/CIDM.2011.5949421
Filename :
5949421
Link To Document :
بازگشت