مرکز منطقه ای اطلاع رساني علوم و فناوري - An Approach to Pose-Based Action Recognition

DocumentCode :

639391

Title :

An Approach to Pose-Based Action Recognition

Author :

Chunyu Wang ; Yizhou Wang ; Yuille, Alan L.

Author_Institution :

Nat´l Eng. Lab. for Video Technol., Peking Univ., Beijing, China

fYear :

2013

fDate :

23-28 June 2013

Firstpage :

915

Lastpage :

922

Abstract :

We address action recognition in videos by modeling the spatial-temporal structures of human poses. We start by improving a state of the art method for estimating human joint locations from videos. More precisely, we obtain the K-best estimations output by the existing method and incorporate additional segmentation cues and temporal constraints to select the ``best´´ one. Then we group the estimated joints into five body parts (e.g. the left arm) and apply data mining techniques to obtain a representation for the spatial-temporal structures of human actions. This representation captures the spatial configurations of body parts in one frame (by spatial-part-sets) as well as the body part movements(by temporal-part-sets) which are characteristic of human actions. It is interpretable, compact, and also robust to errors on joint estimations. Experimental results first show that our approach is able to localize body joints more accurately than existing methods. Next we show that it outperforms state of the art action recognizers on the UCF sport, the Keck Gesture and the MSR-Action3D datasets.

Keywords :

data mining; gesture recognition; image motion analysis; image representation; image segmentation; pose estimation; video signal processing; K-best estimations; Keck gesture; MSR-action3D datasets; UCF sport; body part movements; body parts spatial configurations; data mining; human actions; human joint locations estimation; human poses; joint estimations; pose-based action recognition; segmentation cues; spatial-part-sets; spatial-temporal structures modeling; spatial-temporal structures representation; temporal constraints; temporal-part-sets; videos; Data mining; Dictionaries; Estimation; Histograms; Image color analysis; Itemsets; Joints; action recognition; feature learning; pose estimation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on

Conference_Location :

Portland, OR

ISSN :

1063-6919

Type :

conf

DOI :

10.1109/CVPR.2013.123

Filename :

6618967

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=639391