مرکز منطقه ای اطلاع رساني علوم و فناوري - Boosted Exemplar Learning for Action Recognition and Annotation

DocumentCode :

1494359

Title :

Boosted Exemplar Learning for Action Recognition and Annotation

Author :

Zhang, Tianzhu ; Liu, Jing ; Liu, Si ; Xu, Changsheng ; Lu, Hanqing

Author_Institution :

Nat. Lab. of Pattern Recognition, Chinese Acad. of Sci., Beijing, China

Volume :

Issue :

fYear :

2011

fDate :

7/1/2011 12:00:00 AM

Firstpage :

853

Lastpage :

866

Abstract :

Human action recognition and annotation is an active research topic in computer vision. How to model various actions, varying with time resolution, visual appearance, and others, is a challenging task. In this paper, we propose a boosted exemplar learning (BEL) approach to model various actions in a weakly supervised manner, i.e., only action bag-level labels are provided but action instance level ones are not. The proposed BEL method can be summarized as three steps. First, for each action category, amount of class-specific candidate exemplars are learned through an optimization formulation considering their discrimination and co-occurrence. Second, each action bag is described as a set of similarities between its instances and candidate exemplars. Instead of simply using a heuristic distance measure, the similarities are decided by the exemplar-based classifiers through the multiple instance learning, in which a positive (or negative) video or image set is deemed as a positive (or negative) action bag and those frames similar to the given exemplar in Euclidean Space as action instances. Third, we formulate the selection of the most discriminative exemplars into a boosted feature selection framework and simultaneously obtain an action bag-based detector. Experimental results on two publicly available datasets: the KTH dataset and Weizmann dataset, demonstrate the validity and effectiveness of the proposed approach for action recognition. We also apply BEL to learn representations of actions by using images collected from the Web and use this knowledge to automatically annotate action in YouTube videos. Results are very impressive, which proves that the proposed algorithm is also practical in unconstraint environments.

Keywords :

computer vision; image classification; learning (artificial intelligence); optimisation; video signal processing; Euclidean space; KTH dataset; Weizmann dataset; World Wide Web; YouTube videos; action bag-based detector; boosted exemplar learning; boosted feature selection framework; computer vision; discriminative exemplar; exemplar-based classifier; human action annotation; human action recognition; instance learning; optimization formulation; Computational modeling; Histograms; Humans; Measurement; Semantics; Training; YouTube; Action annotation; AdaBoost; action recognition; mi-SVM; multiple instance learning (MIL);

fLanguage :

English

Journal_Title :

Circuits and Systems for Video Technology, IEEE Transactions on

Publisher :

ieee

ISSN :

1051-8215

Type :

jour

DOI :

10.1109/TCSVT.2011.2133090

Filename :

5750040

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1494359