Title :
Recognizing actions using salient features
Author :
Wang, Liang ; Zhao, Debin
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
Abstract :
Towards a compact video feature representation, we propose a novel feature selection methodology for action recognition based on the saliency maps of videos. Since saliency maps measure the perceptual importance of the pixels and regions in videos, selecting features using saliency maps enables us to find a feature representation that covers the informative parts of a video. Because saliency detection is a bottom-up procedure, some appearance changes or motions that are irrelevant to actions may also be detected as salient regions. To further improve the purity of the feature representation, we prune these irrelevant salient regions using the saliency values distribution and the spatial-temporal distribution of the salient regions. Extensive experiments are conducted to demonstrate that the proposed feature selection method largely improves the performance of bag-of-video-words model on action recognition based on three different attention models including a static attention model, a motion attention model and their combination.
Keywords :
feature extraction; image recognition; image representation; bag-of-video-word model; feature selection methodology; pixels perceptual importance; saliency detection; saliency value distribution; spatial-temporal distribution; static attention model; video feature representation; video saliency map; Accuracy; Computational modeling; Detectors; Feature extraction; Histograms; Training; YouTube;
Conference_Titel :
Multimedia Signal Processing (MMSP), 2011 IEEE 13th International Workshop on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4577-1432-0
Electronic_ISBN :
978-1-4577-1433-7
DOI :
10.1109/MMSP.2011.6093832