DocumentCode
254102
Title
Improving Semantic Concept Detection through the Dictionary of Visually-Distinct Elements
Author
Dehghan, Afshin ; Idrees, Haroon ; Shah, Mubarak
Author_Institution
Center for Res. in Comput. Vision, Univ. of Central Florida, Orlando, FL, USA
fYear
2014
fDate
23-28 June 2014
Firstpage
2585
Lastpage
2592
Abstract
A video captures a sequence and interactions of concepts that can be static, for instance, objects or scenes, or dynamic, such as actions. For large datasets containing hundreds of thousands of images or videos, it is impractical to manually annotate all the concepts, or all the instances of a single concept. However, a dictionary with visually-distinct elements can be created automatically from unlabeled videos which can capture and express the entire dataset. The downside to this machine-discovered dictionary is meaninglessness, i.e., its elements are devoid of semantics and interpretation. In this paper, we present an approach that leverages the strengths of semantic concepts and the machine-discovered DOVE by learning a relationship between them. Since instances of a semantic concept share visual similarity, the proposed approach uses soft-consensus regularization to learn the mapping that enforces instances from each semantic concept to have similar representations. The testing is performed by projecting the query onto the DOVE as well as new representations of semantic concepts from training, with non-negativity and unit summation constraints for probabilistic interpretation. We tested our formulation on TRECVID MED and SIN tasks, and obtained encouraging results.
Keywords
image representation; indexing; learning (artificial intelligence); probability; video signal processing; DOVE; TRECVID MED task; TRECVID SIN task; dictionary of visually-distinct elements; semantic concept detection; semantic indexing; soft consensus regularization; unit summation constraints; visual similarity; Computer vision; Dictionaries; Semantics; Silicon compounds; Testing; Training; Vectors; Attribute; Concept Detection; Consensus Regularization; Dictionary Learning; Event Detection; Sparse Representation; TRECVID MED; TRECVID SIN;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
Conference_Location
Columbus, OH
Type
conf
DOI
10.1109/CVPR.2014.331
Filename
6909727
Link To Document