• DocumentCode
    11846
  • Title

    Human Action Recognition in Unconstrained Videos by Explicit Motion Modeling

  • Author

    Yu-Gang Jiang ; Qi Dai ; Wei Liu ; Xiangyang Xue ; Chong-Wah Ngo

  • Author_Institution
    Sch. of Comput. Sci., Fudan Univ., Shanghai, China
  • Volume
    24
  • Issue
    11
  • fYear
    2015
  • fDate
    Nov. 2015
  • Firstpage
    3781
  • Lastpage
    3795
  • Abstract
    Human action recognition in unconstrained videos is a challenging problem with many applications. Most state-of-the-art approaches adopted the well-known bag-of-features representations, generated based on isolated local patches or patch trajectories, where motion patterns, such as object-object and object-background relationships are mostly discarded. In this paper, we propose a simple representation aiming at modeling these motion relationships. We adopt global and local reference points to explicitly characterize motion information, so that the final representation is more robust to camera movements, which widely exist in unconstrained videos. Our approach operates on the top of visual codewords generated on dense local patch trajectories, and therefore, does not require foreground-background separation, which is normally a critical and difficult step in modeling object relationships. Through an extensive set of experimental evaluations, we show that the proposed representation produces a very competitive performance on several challenging benchmark data sets. Further combining it with the standard bag-of-features or Fisher vector representations can lead to substantial improvements.
  • Keywords
    cameras; feature extraction; image motion analysis; image recognition; image representation; video signal processing; Fisher vector representation; bag-of-feature representation; camera movement; dense local patch trajectory; explicit motion model; human action recognition; isolated local patch trajectory; unconstrained video; visual codeword generation; Benchmark testing; Cameras; Shape; Tracking; Trajectory; Videos; Visualization; Human action recognition; camera motion; motion representation; reference points; trajectory;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2015.2456412
  • Filename
    7156132