• DocumentCode
    3004594
  • Title

    Discriminative subvolume search for efficient action detection

  • Author

    Junsong Yuan ; Zicheng Liu ; Ying Wu

  • Author_Institution
    EECS Dept., Northwestern Univ., Evanston, IL, USA
  • fYear
    2009
  • fDate
    20-25 June 2009
  • Firstpage
    2442
  • Lastpage
    2449
  • Abstract
    Actions are spatio-temporal patterns which can be characterized by collections of spatio-temporal invariant features. Detection of actions is to find the re-occurrences (e.g. through pattern matching) of such spatio-temporal patterns. This paper addresses two critical issues in pattern matching-based action detection: (1) efficiency of pattern search in 3D videos and (2) tolerance of intra-pattern variations of actions. Our contributions are two-fold. First, we propose a discriminative pattern matching called naive-Bayes based mutual information maximization (NBMIM) for multi-class action categorization. It improves the state-of-the-art results on standard KTH dataset. Second, a novel search algorithm is proposed to locate the optimal subvolume in the 3D video space for efficient action detection. Our method is purely data-driven and does not rely on object detection, tracking or background subtraction. It can well handle the intra-pattern variations of actions such as scale and speed variations, and is insensitive to dynamic and clutter backgrounds and even partial occlusions. The experiments on versatile datasets including KTH and CMU action datasets demonstrate the effectiveness and efficiency of our method.
  • Keywords
    Bayes methods; feature extraction; gesture recognition; pattern matching; video signal processing; 3D video space; action detection; discriminative pattern matching; discriminative subvolume search; intrapattern variation; multiclass action categorization; naive-Bayes based mutual information maximization; pattern search; spatiotemporal invariant features; spatiotemporal pattern; Clothing; Computational efficiency; Humans; Image sampling; Mutual information; Object detection; Pattern matching; Training data; Video sequences; Voting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on
  • Conference_Location
    Miami, FL
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4244-3992-8
  • Type

    conf

  • DOI
    10.1109/CVPR.2009.5206671
  • Filename
    5206671