مرکز منطقه ای اطلاع رساني علوم و فناوري - Monocular viewpoint invariant human activity recognition

DocumentCode :

2251072

Title :

Monocular viewpoint invariant human activity recognition

Author :

Htike, Zaw Zaw ; Egerton, Simon ; Chow, Kuang Ye

Author_Institution :

Sch. of Inf. Technol., Monash Univ., Bandar Sunway, Malaysia

fYear :

2011

fDate :

17-19 Sept. 2011

Firstpage :

Lastpage :

Abstract :

One of the grand goals of robotics is to have assistive robots living side-by-side with humans, autonomously assisting humans in everyday activities. To be able to interact with humans and assist them, robots must be able to understand and interpret human activities. There is a growing interest in the problem of human activity recognition. Despite much progress, most computer vision researchers have narrowed the problem towards fixed camera viewpoint owing to inherent difficulty to train their systems across all possible viewpoints. However, since the robots and humans are free to move around in the environment, the viewpoint of a robot with respect to a person varies all the time. Therefore, we attempt to relax the infamous fixed viewpoint assumption and present a novel and efficient framework to recognize and classify human activities from monocular video source from arbitrary viewpoint. The proposed framework comprises of two stages: human pose recognition and human activity recognition. In the pose recognition stage, an ensemble of pose models performs inference on each video frame. Each pose model estimates the probability that the given frame contains the corresponding pose. Over a sequence of frames, each pose model forms a time series. In the activity recognition stage, we use nearest neighbor, with dynamic time warping as a distance measure, to classify pose time series. We have built a small-scale proof-of-concept model and performed some experiments on three publicly available datasets. The satisfactory experimental results demonstrate the efficacy of our framework and encourage us to further develop a full-scale architecture.

Keywords :

cameras; human-robot interaction; image motion analysis; pose estimation; robot vision; time series; assistive robots; computer vision researchers; dynamic time warping; fixed camera viewpoint; human pose recognition; monocular video source; monocular viewpoint invariant human activity recognition; proof-of-concept model; time series; Cameras; Hidden Markov models; Humans; Robots; Three dimensional displays; Time series analysis; Training;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Robotics, Automation and Mechatronics (RAM), 2011 IEEE Conference on

Conference_Location :

Qingdao

ISSN :

2158-2181

Print_ISBN :

978-1-61284-252-3

Type :

conf

DOI :

10.1109/RAMECH.2011.6070449

Filename :

6070449

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2251072