Title :
The Moving Pose: An Efficient 3D Kinematics Descriptor for Low-Latency Action Recognition and Detection
Author :
Zanfir, Mihai ; Leordeanu, Marius ; Sminchisescu, Cristian
Author_Institution :
Inst. of Math., Bucharest, Romania
Abstract :
Human action recognition under low observational latency is receiving a growing interest in computer vision due to rapidly developing technologies in human-robot interaction, computer gaming and surveillance. In this paper we propose a fast, simple, yet powerful non-parametric Moving Pose (MP) framework for low-latency human action and activity recognition. Central to our methodology is a moving pose descriptor that considers both pose information as well as differential quantities (speed and acceleration) of the human body joints within a short time window around the current frame. The proposed descriptor is used in conjunction with a modified kNN classifier that considers both the temporal location of a particular frame within the action sequence as well as the discrimination power of its moving pose descriptor compared to other frames in the training set. The resulting method is non-parametric and enables low-latency recognition, one-shot learning, and action detection in difficult unsegmented sequences. Moreover, the framework is real-time, scalable, and outperforms more sophisticated approaches on challenging benchmarks like MSR-Action3D or MSR-DailyActivities3D.
Keywords :
image classification; image motion analysis; image sequences; learning (artificial intelligence); nonparametric statistics; object detection; object recognition; pose estimation; 3D kinematics descriptor; action sequence; computer vision; human action recognition; human body joints; human-robot interaction; low-latency action detection; low-latency action recognition; low-latency human action recognition; low-latency human activity recognition; modified kNN classifier; moving pose descriptor; nonparametric MP framework; nonparametric Moving Pose framework; one-shot learning; pose information; short time window; surveillance; temporal location; unsegmented sequences; Acceleration; Accuracy; Joints; Kinematics; Three-dimensional displays; Training; RGB-D cameras; action detection; action recognition; moving pose descriptor;
Conference_Titel :
Computer Vision (ICCV), 2013 IEEE International Conference on
Conference_Location :
Sydney, NSW
DOI :
10.1109/ICCV.2013.342