DocumentCode :
3422527
Title :
Video Event Understanding Using Natural Language Descriptions
Author :
Ramanathan, Vignesh ; Liang, P. ; Li Fei-Fei
fYear :
2013
fDate :
1-8 Dec. 2013
Firstpage :
905
Lastpage :
912
Abstract :
Human action and role recognition play an important part in complex event understanding. State-of-the-art methods learn action and role models from detailed spatio temporal annotations, which requires extensive human effort. In this work, we propose a method to learn such models based on natural language descriptions of the training videos, which are easier to collect and scale with the number of actions and roles. There are two challenges with using this form of weak supervision: First, these descriptions only provide a high-level summary and often do not directly mention the actions and roles occurring in a video. Second, natural language descriptions do not provide spatio temporal annotations of actions and roles. To tackle these challenges, we introduce a topic-based semantic relatedness (SR) measure between a video description and an action and role label, and incorporate it into a posterior regularization objective. Our event recognition system based on these action and role models matches the state-of-the-art method on the TRECVID-MED11 event kit, despite weaker supervision.
Keywords :
gesture recognition; natural language processing; video signal processing; TRECVID-MED11 event kit; event recognition system; human action recognition; human role recognition; natural language descriptions; posterior regularization; semantic relatedness; spatio temporal annotations; training videos; video description; video event understanding; Atomic measurements; Encyclopedias; Instruments; Internet; Natural languages; Training; YouTube; natural language descriptions; video event recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision (ICCV), 2013 IEEE International Conference on
Conference_Location :
Sydney, NSW
ISSN :
1550-5499
Type :
conf
DOI :
10.1109/ICCV.2013.117
Filename :
6751222
Link To Document :
بازگشت