DocumentCode
3672351
Title
Dynamically encoded actions based on spacetime saliency
Author
Christoph Feichtenhofer;Axel Pinz;Richard P. Wildes
Author_Institution
Institute of Electrical Measurement and Measurement Signal Processing, TU Graz, Austria
fYear
2015
fDate
6/1/2015 12:00:00 AM
Firstpage
2755
Lastpage
2764
Abstract
Human actions typically occur over a well localized extent in both space and time. Similarly, as typically captured in video, human actions have small spatiotemporal support in image space. This paper capitalizes on these observations by weighting feature pooling for action recognition over those areas within a video where actions are most likely to occur. To enable this operation, we define a novel measure of spacetime saliency. The measure relies on two observations regarding foreground motion of human actors: They typically exhibit motion that contrasts with that of their surrounding region and they are spatially compact. By using the resulting definition of saliency during feature pooling we show that action recognition performance achieves state-of-the-art levels on three widely considered action recognition datasets. Our saliency weighted pooling can be applied to essentially any locally defined features and encodings thereof. Additionally, we demonstrate that inclusion of locally aggregated spatiotemporal energy features, which efficiently result as a by-product of the saliency computation, further boosts performance over reliance on standard action recognition features alone.
Keywords
"Motion measurement","Energy measurement","Spatiotemporal phenomena","Encoding","Weight measurement","Support vector machines","Three-dimensional displays"
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on
Electronic_ISBN
1063-6919
Type
conf
DOI
10.1109/CVPR.2015.7298892
Filename
7298892
Link To Document