Title :
Multimodal semi-supervised image classification by combining tag refinement, graph-based learning and support vector regression
Author :
Wenxuan Xie ; Zhiwu Lu ; Yuxin Peng ; Jianguo Xiao
Author_Institution :
Inst. of Comput. Sci. & Technol., Peking Univ., Beijing, China
Abstract :
We investigate an image classification task where the training images come along with tags, but only a subset being labeled, and the goal is to predict the class label of test images without tags. This task is crucial for image search engine on photo sharing Web sites. In previous work, it is handled by first learning a multiple kernel learning classifier using both image content and tags to score unlabeled training images, and then building up a least-squares regression (LSR) model on visual features to predict the label of test images. However, there exist three important issues in the task: (1) Image tags on photo sharing Web sites tend to be inaccurate and incomplete, and thus refining them is beneficial; (2) Supervised learning with a limited number of labeled samples may be unreliable to some extent, while a graph-based semi-supervised approach can be adopted by also considering similarities of unlabeled data; (3) LSR is established upon centered visual kernel columns and breaks the symmetry of kernel matrix, whereas support vector regression can readily use the original visual kernel and thus leverage its full power. To handle the task more effectively, we propose to combine tag refinement, graph-based learning and support vector regression together. Experimental results on the PASCAL VOC´07 and MIR Flickr datasets show the superior performance of the proposed approach.
Keywords :
Web sites; graph theory; image classification; learning (artificial intelligence); matrix algebra; regression analysis; search engines; support vector machines; LSR; MIR Flickr dataset; PASCAL VOC dataset; centered visual kernel columns; class label prediction; graph- based learning; graph-based learning; graph-based semisupervised approach; image search engine; image subset labelling; image tags; kernel matrix symmetry; labeled samples; least-squares regression model; multimodal semisupervised image classification; photo sharing Web sites; supervised learning; support vector regression; tag refinement; test images; training images; unlabeled data similarities; Graph-based semi-supervised learning; Support vector regression; Tag refinement;
Conference_Titel :
Image Processing (ICIP), 2013 20th IEEE International Conference on
Conference_Location :
Melbourne, VIC
DOI :
10.1109/ICIP.2013.6738887