• DocumentCode
    3302346
  • Title

    Integrating multiple information of active learning for image classification

  • Author

    Haihui Xu ; Pengpeng Zhao ; Jian Wu ; Zhiming Cui ; Chengchao Li

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Soochow Univ. Suzhou, Suzhou, China
  • fYear
    2013
  • fDate
    13-15 Dec. 2013
  • Firstpage
    374
  • Lastpage
    379
  • Abstract
    In the application of image classification, active learning algorithm can effectively alleviate the efforts of labeling by selecting the most informative instances for user annotation, as well as obtain a satisfactory classifier. Traditional active learning methods do not consider the cost of manual labeling, which is usually regarded as the same. They focus on minimizing the classification error, aiming at improving the classifier performance. However, in fact, the user annotation cost is not equal and changes dynamically. We introduce the value of the information framework to measure the instance informativeness, which including misclassification risk and the cost of user annotation. While the value of information is based on probability over the current classifier, only taking into the labeled examples account, thus it may query the outliers. In order to simultaneously lever the distribution information of a large amount of the remaining unlabeled instances, we use information density to measure the representativeness of the sample. To this end, we propose an integrating multiple information of active learning method for image classification (IMIM), which incorporates the strength of both value of information and information density measure criteria by a heuristic weighting strategy. At last, select the most informative instance by the expected error reduction method. Compared with the state of art method, experimental results on diverse datasets demonstrate the effectiveness of our proposed method.
  • Keywords
    image classification; learning (artificial intelligence); active learning algorithm; distribution information; diverse datasets; error reduction method; heuristic weighting strategy; image classification; information density; instance informativeness; manual labeling; misclassification risk; multiple information integration; user annotation cost; Accuracy; Density measurement; Educational institutions; Labeling; Learning systems; Manuals; Training; Active learning; Information Density; Manual labeling cost; Value of information;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Granular Computing (GrC), 2013 IEEE International Conference on
  • Conference_Location
    Beijing
  • Type

    conf

  • DOI
    10.1109/GrC.2013.6740439
  • Filename
    6740439