مرکز منطقه ای اطلاع رساني علوم و فناوري - A “string of feature graphs” model for recognition of complex activities in natural videos

DocumentCode :

2959510

Title :

A “string of feature graphs” model for recognition of complex activities in natural videos

Author :

Gaur, U. ; Zhu, Y. ; Song, B. ; Roy-Chowdhury, A.

Author_Institution :

Univ. of California, Riverside, CA, USA

fYear :

2011

fDate :

6-13 Nov. 2011

Firstpage :

2595

Lastpage :

2602

Abstract :

Videos usually consist of activities involving interactions between multiple actors, sometimes referred to as complex activities. Recognition of such activities requires modeling the spatio-temporal relationships between the actors and their individual variabilities. In this paper, we consider the problem of recognition of complex activities in a video given a query example. We propose a new feature model based on a string representation of the video which respects the spatio-temporal ordering. This ordered arrangement of local collections of features (e.g., cuboids, STIP), which are the characters in the string, are initially matched using graph-based spectral techniques. Final recognition is obtained by matching the string representations of the query and the test videos in a dynamic programming framework which allows for variability in sampling rates and speed of activity execution. The method does not require tracking or recognition of body parts, is able to identify the region of interest in a cluttered scene, and gives reasonable performance with even a single query example. We test our approach in an example-based video retrieval framework with two publicly available complex activity datasets and provide comparisons against other methods that have studied this problem.

Keywords :

dynamic programming; feature extraction; graph theory; image matching; image recognition; image representation; image sampling; video retrieval; complex activity dataset; complex activity recognition; dynamic programming; example-based video retrieval; feature graph string; feature model; features collection; graph-based spectral technique; natural video; sampling rate; spatio-temporal ordering; spatio-temporal relationship; string representation; Clutter; Dynamic programming; Feature extraction; Testing; Training; Vehicles; Videos;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision (ICCV), 2011 IEEE International Conference on

Conference_Location :

Barcelona

ISSN :

1550-5499

Print_ISBN :

978-1-4577-1101-5

Type :

conf

DOI :

10.1109/ICCV.2011.6126548

Filename :

6126548

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2959510