• DocumentCode
    594987
  • Title

    Action recognition based on spatial-temporal pyramid sparse coding

  • Author

    Xiaojing Zhang ; Hua Zhang ; Xiaochun Cao

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Tianjin Univ., Tianjin, China
  • fYear
    2012
  • fDate
    11-15 Nov. 2012
  • Firstpage
    1455
  • Lastpage
    1458
  • Abstract
    This paper introduces a novel video presentation term spatial-temporal pyramid sparse coding (STPSC) which characterizes both the spatial and temporal aspects of the video. Specifically, the co-occurrences of visual words are computed with respect to the spatial layout and the sequencing of the features in the video. The representation captures both the spatial arrangement and the temporal relationship of the words. Our representation is motivated by the technology spatial pyramid matching (SPM) which is used to recognize scenes in the image. We extend SPM to video analysis combining with sparse coding. Firstly, dense feature points are extracted and represented by displacement information from a dense optical flow field. Then sparse coding is used to quantize the feature descriptors, and the spatial-temporal pyramid is introduced to represent an action. Finally, we use SVM to classify the videos. Experimental results showed improvements over the state-of-the-art techniques on the public action dataset.
  • Keywords
    feature extraction; image classification; image matching; image sequences; support vector machines; video coding; SPM technology; STPSC; SVM; action recognition; dense feature point extraction; dense optical flow field; displacement information; feature descriptor; image recognition; spatial layout; spatial pyramid matching; spatial-temporal pyramid sparse coding; support vector machines; video analysis; video classification; video feature sequence; video presentation term; video representation; video spatial aspect; video temporal aspect; visual words cooccurrence; Conferences; Dictionaries; Encoding; Feature extraction; Pattern recognition; Trajectory; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2012 21st International Conference on
  • Conference_Location
    Tsukuba
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4673-2216-4
  • Type

    conf

  • Filename
    6460416