DocumentCode :
3001021
Title :
Understanding videos, constructing plots learning a visually grounded storyline model from annotated videos
Author :
Gupta, Arpan ; Srinivasan, P. ; Jianbo Shi ; Davis, Larry S.
Author_Institution :
Univ. of Maryland, College Park, MD, USA
fYear :
2009
fDate :
20-25 June 2009
Firstpage :
2012
Lastpage :
2019
Abstract :
Analyzing videos of human activities involves not only recognizing actions (typically based on their appearances), but also determining the story/plot of the video. The storyline of a video describes causal relationships between actions. Beyond recognition of individual actions, discovering causal relationships helps to better understand the semantic meaning of the activities. We present an approach to learn a visually grounded storyline model of videos directly from weakly labeled data. The storyline model is represented as an AND-OR graph, a structure that can compactly encode storyline variation across videos. The edges in the AND-OR graph correspond to causal relationships which are represented in terms of spatio-temporal constraints. We formulate an Integer Programming framework for action recognition and storyline extraction using the storyline model and visual groundings learned from training data.
Keywords :
graph theory; image representation; integer programming; learning (artificial intelligence); spatiotemporal phenomena; video coding; AND-OR graph; encoding; human action recognition; human activity analysis; integer programming framework; plots learning construction; semantic meaning; spatio-temporal constraint; video annotation; video understanding; visually grounded storyline model extraction; Data mining; Educational institutions; Grounding; Humans; Linear programming; Stochastic processes; Surveillance; Traffic control; Training data; Videos;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on
Conference_Location :
Miami, FL
ISSN :
1063-6919
Print_ISBN :
978-1-4244-3992-8
Type :
conf
DOI :
10.1109/CVPR.2009.5206492
Filename :
5206492
Link To Document :
بازگشت