• DocumentCode
    3295170
  • Title

    Learning Semantic Motion Patterns for Dynamic Scenes by Improved Sparse Topical Coding

  • Author

    Fu, Wei ; Wang, Jinqiao ; Li, Zechao ; Lu, Hanqing ; Ma, Songde

  • Author_Institution
    Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
  • fYear
    2012
  • fDate
    9-13 July 2012
  • Firstpage
    296
  • Lastpage
    301
  • Abstract
    With the proliferation of cameras in public areas, it becomes increasingly desirable to develop fully automated surveillance and monitoring systems. In this paper, we propose a novel unsupervised approach to automatically explore motion patterns occurring in dynamic scenes under an improved sparse topical coding (STC) framework. Given an input video with a fixed camera, we first segment the whole video into a sequence of clips (documents) without overlapping. Optical flow features are extracted from each pair of consecutive frames, and quantized into discrete visual words. Then the video is represented by a word-document hierarchical topic model through a generative process. Finally, an improved sparse topical coding approach is proposed for model learning. The semantic motion patterns (latent topics) are learned automatically and each video clip is represented as a weighted summation of these patterns with only a few nonzero coefficients. The proposed approach is purely data-driven and scene independent (not an object-class specific), which make it suitable for very large range of scenarios. Experiments demonstrate that our approach outperforms the state-of-the art technologies in dynamic scene analysis.
  • Keywords
    image motion analysis; image sequences; learning (artificial intelligence); video surveillance; cameras; discrete visual words; dynamic scenes; fully automated surveillance; improved sparse topical coding framework; latent topics; model learning; monitoring systems; nonzero coefficients; optical flow features; public areas; semantic motion patterns; unsupervised approach; video clip; weighted summation; word-document hierarchical topic model; Cameras; Computational modeling; Dictionaries; Dynamics; Encoding; Semantics; Visualization; motion patterns; scene model; sparse topical coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo (ICME), 2012 IEEE International Conference on
  • Conference_Location
    Melbourne, VIC
  • ISSN
    1945-7871
  • Print_ISBN
    978-1-4673-1659-0
  • Type

    conf

  • DOI
    10.1109/ICME.2012.133
  • Filename
    6298413