• DocumentCode
    3024445
  • Title

    Active learning with optimal distribution for image classification

  • Author

    Wu, Weining ; Guo, Maozu ; Liu, Yang ; Xu, Runzhang

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
  • fYear
    2011
  • fDate
    26-28 July 2011
  • Firstpage
    132
  • Lastpage
    136
  • Abstract
    In this paper, we focus on the issue of building up a training set for the task of image classification at minimal labeling costs. It is a topic that has attracted the considerable attention in the recent years. We propose a novel active learning algorithm with optimal distribution. In order to solve the problems of the noisy distribution and the sampling bias in the actively sampling process, the empirical risk on the selected examples is weighted by density ratio, and then the risk on the test examples is estimated using only unlabeled examples and the marginal label distribution. Finally, the optimal training distribution is derived by minimizing the expected error of the risk. Our approach has been demonstrated on the task of image classification on the difficult benchmark PASCAL VOC 2007 dataset.
  • Keywords
    image classification; learning (artificial intelligence); active learning algorithm; active sampling process; benchmark PASCAL VOC 2007 dataset; density ratio; empirical risk; image classification; marginal label distribution; minimal labeling costs; noisy distribution; optimal training distribution; sampling bias; Classification algorithms; Databases; Estimation; Image classification; Labeling; Machine learning; Training; image classification; importance weighting; pool-based active learning; risk estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Technology (ICMT), 2011 International Conference on
  • Conference_Location
    Hangzhou
  • Print_ISBN
    978-1-61284-771-9
  • Type

    conf

  • DOI
    10.1109/ICMT.2011.6001789
  • Filename
    6001789