DocumentCode :
69012
Title :
Geometric Optimum Experimental Design for Collaborative Image Retrieval
Author :
Lining Zhang ; Lipo Wang ; Weisi Lin ; Shuicheng Yan
Author_Institution :
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
Volume :
24
Issue :
2
fYear :
2014
fDate :
Feb. 2014
Firstpage :
346
Lastpage :
359
Abstract :
Relevance feedback (RF) schemes have been widely designed to improve the performance of content-based image retrieval. Despite the success, it is not appropriate to require the user to label a large number of samples in RF. Collaborative image retrieval (CIR) aims to reduce the labeling efforts of the user by resorting to the auxiliary information. Support vector machine (SVM) active learning can select ambiguous samples as the most informative ones for the user to label with the help of the optimal hyperplane of SVM, and thus alleviate the labeling efforts of conventional RF. However, the optimal hyperplane of SVM is usually unstable and inaccurate with small-sized training data, and this is always the case in image retrieval since the user would not like to label a large number of feedback samples and cannot label each sample accurately all the time. In this paper, we propose a novel active learning method, i.e., geometric optimum experimental design (GOED), to select multiple representative samples in the database as the most informative ones for the user to label. Especially, GOED can alleviate the small-sized training data problem by leveraging the geometric structure of unlabeled samples in the reproducing kernel Hilbert space and thus further enhance the performance of image retrieval. Different from the conventional manifold regularization framework, the new method can effectively select the most informative samples for the user to label in image retrieval. By minimizing the expected average prediction variance on the test data, GOED has a clear geometric interpretation to select a set of the most representative samples in the database iteratively with the global optimum. Compared with the popular SVM active learning, our method is label-independent and can effectively avoid various potential problems caused by insufficient and inexactly labeled samples in RF, and is more appropriate and useful for image retrieval. Extensive experiments on both synthetic da- asets and a real-world image database have been conducted to show the advantages of the proposed GOED for CIR.
Keywords :
Hilbert spaces; content-based retrieval; design of experiments; geometry; image retrieval; learning (artificial intelligence); relevance feedback; support vector machines; CIR; GOED; RF; auxiliary information; collaborative image retrieval; content-based image retrieval; geometric optimum experimental design; kernel Hilbert space; optimal SVM hyperplane; real-world image database; relevance feedback schemes; small-sized training data problem; support vector machine active learning; synthetic datasets; unlabeled samples; Image retrieval; Kernel; Manifolds; Radio frequency; Support vector machines; Training data; Active learning; content-based image retrieval (CBIR); manifold regularization; optimum experimental design; relevance feedback (RF);
fLanguage :
English
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
Publisher :
ieee
ISSN :
1051-8215
Type :
jour
DOI :
10.1109/TCSVT.2013.2276172
Filename :
6574277
Link To Document :
بازگشت