Title :
Action recognition based on spatial-temporal pyramid sparse coding
Author :
Xiaojing Zhang ; Hua Zhang ; Xiaochun Cao
Author_Institution :
Sch. of Comput. Sci. & Technol., Tianjin Univ., Tianjin, China
Abstract :
This paper introduces a novel video presentation term spatial-temporal pyramid sparse coding (STPSC) which characterizes both the spatial and temporal aspects of the video. Specifically, the co-occurrences of visual words are computed with respect to the spatial layout and the sequencing of the features in the video. The representation captures both the spatial arrangement and the temporal relationship of the words. Our representation is motivated by the technology spatial pyramid matching (SPM) which is used to recognize scenes in the image. We extend SPM to video analysis combining with sparse coding. Firstly, dense feature points are extracted and represented by displacement information from a dense optical flow field. Then sparse coding is used to quantize the feature descriptors, and the spatial-temporal pyramid is introduced to represent an action. Finally, we use SVM to classify the videos. Experimental results showed improvements over the state-of-the-art techniques on the public action dataset.
Keywords :
feature extraction; image classification; image matching; image sequences; support vector machines; video coding; SPM technology; STPSC; SVM; action recognition; dense feature point extraction; dense optical flow field; displacement information; feature descriptor; image recognition; spatial layout; spatial pyramid matching; spatial-temporal pyramid sparse coding; support vector machines; video analysis; video classification; video feature sequence; video presentation term; video representation; video spatial aspect; video temporal aspect; visual words cooccurrence; Conferences; Dictionaries; Encoding; Feature extraction; Pattern recognition; Trajectory; Visualization;
Conference_Titel :
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location :
Tsukuba
Print_ISBN :
978-1-4673-2216-4