Title :
Efficient Near-Duplicate Keyframe Retrieval with Visual Language Models
Author :
Wu, Xiao ; Zhao, Wan-Lei ; Ngo, Chong Wah
Author_Institution :
City Univ. of Hong Kong, Hong Kong
Abstract :
Near-duplicate keyframe retrieval is a critical task for video similarity measure, video threading and tracking. In this paper, instead of using expensive point-to-point matching on keypoints, we investigate the visual language models built on visual keywords to speed up the near-duplicate keyframe retrieval. The main idea is to estimate a visual language model on visual keywords for each keyframe and compare keyframes by the likelihood of their visual language models. Experiments on a subset of TRECVID-2004 video corpus show that visual language models built on visual keywords demonstrate promising performance for near-duplicate keyframe retrieval, which greatly speed up the retrieval speed although sacrifice a little performance compared to expensive point-to-point matching.
Keywords :
image retrieval; video signal processing; visual languages; TRECVID-2004 video corpus; near-duplicate keyframe retrieval; point-to-point matching; video similarity; video threading; video tracking; visual language model; visual language models; Computer science; Councils; Detectors; Information retrieval; Natural language processing; Natural languages; Object detection; Photometry; Speech recognition; Vocabulary;
Conference_Titel :
Multimedia and Expo, 2007 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
1-4244-1016-9
Electronic_ISBN :
1-4244-1017-7
DOI :
10.1109/ICME.2007.4284696