مرکز منطقه ای اطلاع رساني علوم و فناوري - Graphical framework for action recognition using temporally dense STIPs

DocumentCode :

3289669

Title :

Graphical framework for action recognition using temporally dense STIPs

Author :

Natarajan, Pradeep ; Banerjee, Prithviraj ; Khan, Furqan M. ; Nevatia, Ramakant

Author_Institution :

BBN Technol., Cambridge, MA, USA

fYear :

2009

fDate :

8-9 Dec. 2009

Firstpage :

Lastpage :

Abstract :

Graphical models have been shown to provide a natural framework for modelling high level action transition constraints, and to simultaneously segment and recognize a sequence of actions. Spatio-temporal interest points (STIPs) have been proposed as suitable features for action detection. These interest points are typically mapped to a set of codewords, and actions are detected by accumulating the codeword weights or by learning suitable classifiers. Existing methods for interest point detection provide a sparse representation of actions and require a costly exhaustive search over the entire spatio-temporal volume for action classification. Our contribution here is two-fold -first, we combine the interest point models of actions with pedestrian detection and tracking using a conditional random field (CRF); second, we extend existing interest point detectors to provide a dense action representation while minimizing spurious detections. The larger number of interest points and the high-level reasoning provided by the CRF allows us to automatically recognize action sequences from an unsegmented stream, at real time speed. We demonstrate our approach by showing results comparable to state-of-the-art for action classification on the standard KTH-action set, and also on more challenging cluttered videos.

Keywords :

image classification; image coding; image representation; image segmentation; image sequences; random processes; tracking; action classification; action detection; action representation; action sequence recognition; action sequence segmentation; codewords; conditional random field; graphical framework; high level action transition constraint modelling; high-level reasoning; interest point detection; pedestrian detection; pedestrian tracking; sparse representation; spatio-temporal interest points; Computer vision; Detectors; Feature extraction; Graphical models; Hidden Markov models; Intelligent robots; Intelligent systems; Layout; Videos; Yarn;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Motion and Video Computing, 2009. WMVC '09. Workshop on

Conference_Location :

Snowbird, UT

Print_ISBN :

978-1-4244-5500-3

Type :

conf

DOI :

10.1109/WMVC.2009.5399230

Filename :

5399230

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3289669