Title :
Recognizing actions via sparse coding on structure projection
Author :
Lei Zhang ; Tao Wang ; Xiantong Zhen
Author_Institution :
Coll. of Inf. & Commun. Eng., Harbin Eng. Univ., Harbin, China
Abstract :
In this paper, we propose a novel method for human action recognition based on sparse coding with a pyramid matching. Spatio-temporal interest points (STIPs) are firstly detected by a newly developed detector named spatio-temporal steerable detector (STSD). To effectively capture the distribution of STIPs in the video sequence, we propose to project the STIPs onto the three orthogonal planes (TOP), and we employ a sparse coding algorithm combined with the spatial pyramid matching to encode the layout of STIPs. Therefore the structure of an action are sufficiently encoded, obtaining a informative holistic descriptor for action representation. Extensive experiments have been conducted on KTH and HMDB51 datasets. Our method achieves the state-of-the-art performance for action recognition showing the effectiveness of the proposed methods for human action representation.
Keywords :
gesture recognition; image representation; image sequences; video coding; HMDB51 datasets; KTH datasets; STIP; STSD; human action recognition; human action representation; informative holistic descriptor; sparse coding algorithm; spatial pyramid matching; spatio-temporal interest points; spatio-temporal steerable detector; structure projection; three orthogonal planes; video sequence; Spatio-temporal steerable detector; action recognition; projections of interest points; structure information;
Conference_Titel :
Image Processing (ICIP), 2013 20th IEEE International Conference on
Conference_Location :
Melbourne, VIC
DOI :
10.1109/ICIP.2013.6738497