DocumentCode
3295170
Title
Learning Semantic Motion Patterns for Dynamic Scenes by Improved Sparse Topical Coding
Author
Fu, Wei ; Wang, Jinqiao ; Li, Zechao ; Lu, Hanqing ; Ma, Songde
Author_Institution
Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
fYear
2012
fDate
9-13 July 2012
Firstpage
296
Lastpage
301
Abstract
With the proliferation of cameras in public areas, it becomes increasingly desirable to develop fully automated surveillance and monitoring systems. In this paper, we propose a novel unsupervised approach to automatically explore motion patterns occurring in dynamic scenes under an improved sparse topical coding (STC) framework. Given an input video with a fixed camera, we first segment the whole video into a sequence of clips (documents) without overlapping. Optical flow features are extracted from each pair of consecutive frames, and quantized into discrete visual words. Then the video is represented by a word-document hierarchical topic model through a generative process. Finally, an improved sparse topical coding approach is proposed for model learning. The semantic motion patterns (latent topics) are learned automatically and each video clip is represented as a weighted summation of these patterns with only a few nonzero coefficients. The proposed approach is purely data-driven and scene independent (not an object-class specific), which make it suitable for very large range of scenarios. Experiments demonstrate that our approach outperforms the state-of-the art technologies in dynamic scene analysis.
Keywords
image motion analysis; image sequences; learning (artificial intelligence); video surveillance; cameras; discrete visual words; dynamic scenes; fully automated surveillance; improved sparse topical coding framework; latent topics; model learning; monitoring systems; nonzero coefficients; optical flow features; public areas; semantic motion patterns; unsupervised approach; video clip; weighted summation; word-document hierarchical topic model; Cameras; Computational modeling; Dictionaries; Dynamics; Encoding; Semantics; Visualization; motion patterns; scene model; sparse topical coding;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo (ICME), 2012 IEEE International Conference on
Conference_Location
Melbourne, VIC
ISSN
1945-7871
Print_ISBN
978-1-4673-1659-0
Type
conf
DOI
10.1109/ICME.2012.133
Filename
6298413
Link To Document