DocumentCode
639391
Title
An Approach to Pose-Based Action Recognition
Author
Chunyu Wang ; Yizhou Wang ; Yuille, Alan L.
Author_Institution
Nat´l Eng. Lab. for Video Technol., Peking Univ., Beijing, China
fYear
2013
fDate
23-28 June 2013
Firstpage
915
Lastpage
922
Abstract
We address action recognition in videos by modeling the spatial-temporal structures of human poses. We start by improving a state of the art method for estimating human joint locations from videos. More precisely, we obtain the K-best estimations output by the existing method and incorporate additional segmentation cues and temporal constraints to select the ``best´´ one. Then we group the estimated joints into five body parts (e.g. the left arm) and apply data mining techniques to obtain a representation for the spatial-temporal structures of human actions. This representation captures the spatial configurations of body parts in one frame (by spatial-part-sets) as well as the body part movements(by temporal-part-sets) which are characteristic of human actions. It is interpretable, compact, and also robust to errors on joint estimations. Experimental results first show that our approach is able to localize body joints more accurately than existing methods. Next we show that it outperforms state of the art action recognizers on the UCF sport, the Keck Gesture and the MSR-Action3D datasets.
Keywords
data mining; gesture recognition; image motion analysis; image representation; image segmentation; pose estimation; video signal processing; K-best estimations; Keck gesture; MSR-action3D datasets; UCF sport; body part movements; body parts spatial configurations; data mining; human actions; human joint locations estimation; human poses; joint estimations; pose-based action recognition; segmentation cues; spatial-part-sets; spatial-temporal structures modeling; spatial-temporal structures representation; temporal constraints; temporal-part-sets; videos; Data mining; Dictionaries; Estimation; Histograms; Image color analysis; Itemsets; Joints; action recognition; feature learning; pose estimation;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on
Conference_Location
Portland, OR
ISSN
1063-6919
Type
conf
DOI
10.1109/CVPR.2013.123
Filename
6618967
Link To Document