Title :
Active learning with optimal distribution for image classification
Author :
Wu, Weining ; Guo, Maozu ; Liu, Yang ; Xu, Runzhang
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
Abstract :
In this paper, we focus on the issue of building up a training set for the task of image classification at minimal labeling costs. It is a topic that has attracted the considerable attention in the recent years. We propose a novel active learning algorithm with optimal distribution. In order to solve the problems of the noisy distribution and the sampling bias in the actively sampling process, the empirical risk on the selected examples is weighted by density ratio, and then the risk on the test examples is estimated using only unlabeled examples and the marginal label distribution. Finally, the optimal training distribution is derived by minimizing the expected error of the risk. Our approach has been demonstrated on the task of image classification on the difficult benchmark PASCAL VOC 2007 dataset.
Keywords :
image classification; learning (artificial intelligence); active learning algorithm; active sampling process; benchmark PASCAL VOC 2007 dataset; density ratio; empirical risk; image classification; marginal label distribution; minimal labeling costs; noisy distribution; optimal training distribution; sampling bias; Classification algorithms; Databases; Estimation; Image classification; Labeling; Machine learning; Training; image classification; importance weighting; pool-based active learning; risk estimation;
Conference_Titel :
Multimedia Technology (ICMT), 2011 International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-61284-771-9
DOI :
10.1109/ICMT.2011.6001789