• DocumentCode
    760422
  • Title

    Canonical Correlation Analysis of Video Volume Tensors for Action Categorization and Detection

  • Author

    Kim, Tae-Kyun ; Cipolla, Roberto

  • Author_Institution
    Sidney Sussex Coll., Univ. of Cambridge, Cambridge
  • Volume
    31
  • Issue
    8
  • fYear
    2009
  • Firstpage
    1415
  • Lastpage
    1428
  • Abstract
    This paper addresses a spatiotemporal pattern recognition problem. The main purpose of this study is to find a right representation and matching of action video volumes for categorization. A novel method is proposed to measure video-to-video volume similarity by extending Canonical Correlation Analysis (CCA), a principled tool to inspect linear relations between two sets of vectors, to that of two multiway data arrays (or tensors). The proposed method analyzes video volumes as inputs avoiding the difficult problem of explicit motion estimation required in traditional methods and provides a way of spatiotemporal pattern matching that is robust to intraclass variations of actions. The proposed matching is demonstrated for action classification by a simple Nearest Neighbor classifier. We, moreover, propose an automatic action detection method, which performs 3D window search over an input video with action exemplars. The search is speeded up by dynamic learning of subspaces in the proposed CCA. Experiments on a public action data set (KTH) and a self-recorded hand gesture data showed that the proposed method is significantly better than various state-of-the-art methods with respect to accuracy. Our method has low time complexity and does not require any major tuning parameters.
  • Keywords
    gesture recognition; motion estimation; object detection; pattern classification; pattern matching; tensors; video signal processing; action categorization; action detection; action video volumes; canonical correlation analysis; motion estimation; multiway data arrays; nearest neighbor classifier; public action data set; self-recorded hand gesture data; spatiotemporal pattern matching; spatiotemporal pattern recognition problem; video volume tensors; video-to-video volume similarity; Action categorization; Face and gesture recognition; Feature evaluation and selection; Motion; Statistical; Video analysis; action detection; canonical correlation analysis; gesture recognition; incremental subspace learning; spatiotemporal pattern classification.; tensor;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2008.167
  • Filename
    4547427