DocumentCode :
3427346
Title :
Space-Time Robust Representation for Action Recognition
Author :
Ballas, Nicolas ; Yi Yang ; Zhen-Zhong Lan ; Delezoide, Bertrand ; Preteux, Francoise ; Hauptmann, Alexander
fYear :
2013
fDate :
1-8 Dec. 2013
Firstpage :
2704
Lastpage :
2711
Abstract :
We address the problem of action recognition in unconstrained videos. We propose a novel content driven pooling that leverages space-time context while being robust toward global space-time transformations. Being robust to such transformations is of primary importance in unconstrained videos where the action localizations can drastically shift between frames. Our pooling identifies regions of interest using video structural cues estimated by different saliency functions. To combine the different structural information, we introduce an iterative structure learning algorithm, WSVM (weighted SVM), that determines the optimal saliency layout of an action model through a sparse regularizer. A new optimization method is proposed to solve the WSVM´ highly non-smooth objective function. We evaluate our approach on standard action datasets (KTH, UCF50 and HMDB). Most noticeably, the accuracy of our algorithm reaches 51.8% on the challenging HMDB dataset which outperforms the state-of-the-art of 7.3% relatively.
Keywords :
image motion analysis; image recognition; image representation; iterative methods; learning (artificial intelligence); optimisation; support vector machines; video signal processing; HMDB dataset; WSVM; action datasets; action localizations; action recognition; content driven pooling; global space-time transformations; iterative structure learning algorithm; nonsmooth objective function; optimization method; saliency functions; space-time robust video representation; sparse regularizer; unconstrained videos; video structural cues; weighted SVM; Context; Encoding; Feature extraction; Motion segmentation; Robustness; Support vector machines; Trajectory; WSVM; action recognition; pooling; saliency; sparse regularization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision (ICCV), 2013 IEEE International Conference on
Conference_Location :
Sydney, NSW
ISSN :
1550-5499
Type :
conf
DOI :
10.1109/ICCV.2013.336
Filename :
6751447
Link To Document :
بازگشت