DocumentCode
3302346
Title
Integrating multiple information of active learning for image classification
Author
Haihui Xu ; Pengpeng Zhao ; Jian Wu ; Zhiming Cui ; Chengchao Li
Author_Institution
Sch. of Comput. Sci. & Technol., Soochow Univ. Suzhou, Suzhou, China
fYear
2013
fDate
13-15 Dec. 2013
Firstpage
374
Lastpage
379
Abstract
In the application of image classification, active learning algorithm can effectively alleviate the efforts of labeling by selecting the most informative instances for user annotation, as well as obtain a satisfactory classifier. Traditional active learning methods do not consider the cost of manual labeling, which is usually regarded as the same. They focus on minimizing the classification error, aiming at improving the classifier performance. However, in fact, the user annotation cost is not equal and changes dynamically. We introduce the value of the information framework to measure the instance informativeness, which including misclassification risk and the cost of user annotation. While the value of information is based on probability over the current classifier, only taking into the labeled examples account, thus it may query the outliers. In order to simultaneously lever the distribution information of a large amount of the remaining unlabeled instances, we use information density to measure the representativeness of the sample. To this end, we propose an integrating multiple information of active learning method for image classification (IMIM), which incorporates the strength of both value of information and information density measure criteria by a heuristic weighting strategy. At last, select the most informative instance by the expected error reduction method. Compared with the state of art method, experimental results on diverse datasets demonstrate the effectiveness of our proposed method.
Keywords
image classification; learning (artificial intelligence); active learning algorithm; distribution information; diverse datasets; error reduction method; heuristic weighting strategy; image classification; information density; instance informativeness; manual labeling; misclassification risk; multiple information integration; user annotation cost; Accuracy; Density measurement; Educational institutions; Labeling; Learning systems; Manuals; Training; Active learning; Information Density; Manual labeling cost; Value of information;
fLanguage
English
Publisher
ieee
Conference_Titel
Granular Computing (GrC), 2013 IEEE International Conference on
Conference_Location
Beijing
Type
conf
DOI
10.1109/GrC.2013.6740439
Filename
6740439
Link To Document