Title :
Exploiting Web Images for Semantic Video Indexing Via Robust Sample-Specific Loss
Author :
Yang Yang ; Zheng-Jun Zha ; Yue Gao ; Xiaofeng Zhu ; Tat-Seng Chua
Author_Institution :
Sch. of Comput. Sci. & Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
Abstract :
Semantic video indexing, also known as video annotation or video concept detection in literatures, has been attracting significant attention in recent years. Due to deficiency of labeled training videos, most of the existing approaches can hardly achieve satisfactory performance. In this paper, we propose a novel semantic video indexing approach, which exploits the abundant user-tagged Web images to help learn robust semantic video indexing classifiers. The following two major challenges are well studied: 1) noisy Web images with imprecise and/or incomplete tags; and 2) domain difference between images and videos. Specifically, we first apply a non-parametric approach to estimate the probabilities of images being correctly tagged as confidence scores. We then develop a robust transfer video indexing (RTVI) model to learn reliable classifiers from a limited number of training videos together with the abundance of user-tagged images. The RTVI model is equipped with a novel sample-specific robust loss function, which employs the confidence score of a Web image as prior knowledge to suppress the influence and control the contribution of this image in the learning process. Meanwhile, the RTVI model discovers an optimal kernel space, in which the mismatch between images and videos is minimized for tackling the domain difference problem. Besides, we devise an iterative algorithm to effectively optimize the proposed RTVI model and a theoretical analysis on the convergence of the proposed algorithm is provided as well. Extensive experiments on various real-world multimedia collections demonstrate the effectiveness of the proposed robust semantic video indexing approach.
Keywords :
Internet; image classification; indexing; iterative methods; learning (artificial intelligence); statistical analysis; video signal processing; RTVI model; classifier learning; confidence score; domain difference; iterative algorithm; learning process; nonparametric approach; probability estimation; robust transfer video indexing; sample-specific robust loss function; semantic video indexing; user-tagged Web images; video annotation; video concept detection; Indexing; Kernel; Noise; Noise measurement; Robustness; Semantics; Video signal processing; Robust; semantic video indexing; transfer learning;
Journal_Title :
Multimedia, IEEE Transactions on
DOI :
10.1109/TMM.2014.2323014