DocumentCode
3024445
Title
Active learning with optimal distribution for image classification
Author
Wu, Weining ; Guo, Maozu ; Liu, Yang ; Xu, Runzhang
Author_Institution
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
fYear
2011
fDate
26-28 July 2011
Firstpage
132
Lastpage
136
Abstract
In this paper, we focus on the issue of building up a training set for the task of image classification at minimal labeling costs. It is a topic that has attracted the considerable attention in the recent years. We propose a novel active learning algorithm with optimal distribution. In order to solve the problems of the noisy distribution and the sampling bias in the actively sampling process, the empirical risk on the selected examples is weighted by density ratio, and then the risk on the test examples is estimated using only unlabeled examples and the marginal label distribution. Finally, the optimal training distribution is derived by minimizing the expected error of the risk. Our approach has been demonstrated on the task of image classification on the difficult benchmark PASCAL VOC 2007 dataset.
Keywords
image classification; learning (artificial intelligence); active learning algorithm; active sampling process; benchmark PASCAL VOC 2007 dataset; density ratio; empirical risk; image classification; marginal label distribution; minimal labeling costs; noisy distribution; optimal training distribution; sampling bias; Classification algorithms; Databases; Estimation; Image classification; Labeling; Machine learning; Training; image classification; importance weighting; pool-based active learning; risk estimation;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Technology (ICMT), 2011 International Conference on
Conference_Location
Hangzhou
Print_ISBN
978-1-61284-771-9
Type
conf
DOI
10.1109/ICMT.2011.6001789
Filename
6001789
Link To Document