مرکز منطقه ای اطلاع رساني علوم و فناوري - The Moving Pose: An Efficient 3D Kinematics Descriptor for Low-Latency Action Recognition and Detection

DocumentCode :

3427471

Title :

The Moving Pose: An Efficient 3D Kinematics Descriptor for Low-Latency Action Recognition and Detection

Author :

Zanfir, Mihai ; Leordeanu, Marius ; Sminchisescu, Cristian

Author_Institution :

Inst. of Math., Bucharest, Romania

fYear :

2013

fDate :

1-8 Dec. 2013

Firstpage :

2752

Lastpage :

2759

Abstract :

Human action recognition under low observational latency is receiving a growing interest in computer vision due to rapidly developing technologies in human-robot interaction, computer gaming and surveillance. In this paper we propose a fast, simple, yet powerful non-parametric Moving Pose (MP) framework for low-latency human action and activity recognition. Central to our methodology is a moving pose descriptor that considers both pose information as well as differential quantities (speed and acceleration) of the human body joints within a short time window around the current frame. The proposed descriptor is used in conjunction with a modified kNN classifier that considers both the temporal location of a particular frame within the action sequence as well as the discrimination power of its moving pose descriptor compared to other frames in the training set. The resulting method is non-parametric and enables low-latency recognition, one-shot learning, and action detection in difficult unsegmented sequences. Moreover, the framework is real-time, scalable, and outperforms more sophisticated approaches on challenging benchmarks like MSR-Action3D or MSR-DailyActivities3D.

Keywords :

image classification; image motion analysis; image sequences; learning (artificial intelligence); nonparametric statistics; object detection; object recognition; pose estimation; 3D kinematics descriptor; action sequence; computer vision; human action recognition; human body joints; human-robot interaction; low-latency action detection; low-latency action recognition; low-latency human action recognition; low-latency human activity recognition; modified kNN classifier; moving pose descriptor; nonparametric MP framework; nonparametric Moving Pose framework; one-shot learning; pose information; short time window; surveillance; temporal location; unsegmented sequences; Acceleration; Accuracy; Joints; Kinematics; Three-dimensional displays; Training; RGB-D cameras; action detection; action recognition; moving pose descriptor;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision (ICCV), 2013 IEEE International Conference on

Conference_Location :

Sydney, NSW

ISSN :

1550-5499

Type :

conf

DOI :

10.1109/ICCV.2013.342

Filename :

6751453

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3427471