• DocumentCode
    72688
  • Title

    Detecting Human Action as the Spatio-Temporal Tube of Maximum Mutual Information

  • Author

    Taiqing Wang ; Shengjin Wang ; Xiaoqing Ding

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
  • Volume
    24
  • Issue
    2
  • fYear
    2014
  • fDate
    Feb. 2014
  • Firstpage
    277
  • Lastpage
    290
  • Abstract
    Human action detection in complex scenes is a challenging problem due to its high-dimensional search space and dynamic backgrounds. To achieve efficient and accurate action detection, we represent a video sequence as a collection of feature trajectories and model human action as the spatio-temporal tube (ST-tube) of maximum mutual information. First, a random forest is built to evaluate the mutual information of feature trajectories toward the action class, and then a one-order Markov model is introduced to recursively infer the action regions at consecutive frames. By exploring the time-continuity property of feature trajectories, the action region is efficiently inferred at large temporal intervals. Finally, we obtain an ST-tube by concatenating the consecutive action regions bounding the human bodies. Compared with the popular spatio-temporal cuboid action model, the proposed ST-tube model is not only more efficient, but also more accurate in action localization. Experimental results on the KTH, CMU and UCF sports datasets validate the superiority of our approach over the state-of-the-art methods in both localization accuracy and time efficiency.
  • Keywords
    Markov processes; image sequences; video signal processing; CMU sports datasets; KTH sports datasets; ST-tube; UCF sports datasets; action class; complex scenes; dynamic backgrounds; feature trajectories; high-dimensional search space; human action detection; localization accuracy; maximum mutual information; one-order Markov model; random forest; spatiotemporal tube; temporal intervals; time efficiency; time-continuity property; video sequence; Electron tubes; Feature extraction; Markov processes; Mutual information; Trajectory; Vectors; Video sequences; Action detection; feature trajectory; mutual information; spatio-temporal cuboid (ST-cuboid); spatio-temporal tube (ST-tube);
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems for Video Technology, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1051-8215
  • Type

    jour

  • DOI
    10.1109/TCSVT.2013.2276856
  • Filename
    6575110