مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning Semantic Motion Patterns for Dynamic Scenes by Improved Sparse Topical Coding

DocumentCode :

3295170

Title :

Learning Semantic Motion Patterns for Dynamic Scenes by Improved Sparse Topical Coding

Author :

Fu, Wei ; Wang, Jinqiao ; Li, Zechao ; Lu, Hanqing ; Ma, Songde

Author_Institution :

Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China

fYear :

2012

fDate :

9-13 July 2012

Firstpage :

296

Lastpage :

301

Abstract :

With the proliferation of cameras in public areas, it becomes increasingly desirable to develop fully automated surveillance and monitoring systems. In this paper, we propose a novel unsupervised approach to automatically explore motion patterns occurring in dynamic scenes under an improved sparse topical coding (STC) framework. Given an input video with a fixed camera, we first segment the whole video into a sequence of clips (documents) without overlapping. Optical flow features are extracted from each pair of consecutive frames, and quantized into discrete visual words. Then the video is represented by a word-document hierarchical topic model through a generative process. Finally, an improved sparse topical coding approach is proposed for model learning. The semantic motion patterns (latent topics) are learned automatically and each video clip is represented as a weighted summation of these patterns with only a few nonzero coefficients. The proposed approach is purely data-driven and scene independent (not an object-class specific), which make it suitable for very large range of scenarios. Experiments demonstrate that our approach outperforms the state-of-the art technologies in dynamic scene analysis.

Keywords :

image motion analysis; image sequences; learning (artificial intelligence); video surveillance; cameras; discrete visual words; dynamic scenes; fully automated surveillance; improved sparse topical coding framework; latent topics; model learning; monitoring systems; nonzero coefficients; optical flow features; public areas; semantic motion patterns; unsupervised approach; video clip; weighted summation; word-document hierarchical topic model; Cameras; Computational modeling; Dictionaries; Dynamics; Encoding; Semantics; Visualization; motion patterns; scene model; sparse topical coding;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia and Expo (ICME), 2012 IEEE International Conference on

Conference_Location :

Melbourne, VIC

ISSN :

1945-7871

Print_ISBN :

978-1-4673-1659-0

Type :

conf

DOI :

10.1109/ICME.2012.133

Filename :

6298413

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3295170