DocumentCode :
1475395
Title :
Modeling Scene and Object Contexts for Human Action Retrieval With Few Examples
Author :
Jiang, Yu-Gang ; Li, Zhenguo ; Chang, Shih-Fu
Author_Institution :
Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
Volume :
21
Issue :
5
fYear :
2011
fDate :
5/1/2011 12:00:00 AM
Firstpage :
674
Lastpage :
681
Abstract :
The use of context knowledge is critical for understanding human actions, which typically occur under particular scene settings with certain object interactions. For instance, driving car usually happens outdoors, and kissing involves two people moving toward each other. In this paper, we investigate the problem of context modeling for human action retrieval. We first identify ten simple object-level action atoms relevant to many human actions, e.g., people getting closer. With the action atoms and several background scene classes, we show that action retrieval can be improved through modeling action-scene-object dependency. An algorithm inspired by the popular semi-supervised learning paradigm is introduced for this purpose. One important contribution of this paper is to show that modeling the dependencies among actions, objects, and scenes can be efficiently achieved with very few examples. Such a solution has tremendous potential in practice as it is often expensive to acquire large sets of training data. Experiments were performed on the challenging Hollywood2 dataset containing 89 movies. The results validate the effectiveness of our approach, achieving a mean average precision of 26% with just ten examples per action.
Keywords :
image retrieval; learning (artificial intelligence); object detection; object recognition; Hollywood2 dataset; action-scene-object dependency; context knowledge; context modeling; human action retrieval; object contexts; object-level action atoms; scene contexts; semisupervised learning; Context; Context modeling; Humans; Motion pictures; Training; Videos; Visualization; Action retrieval; context modeling; object and scene recognition; very few examples;
fLanguage :
English
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
Publisher :
ieee
ISSN :
1051-8215
Type :
jour
DOI :
10.1109/TCSVT.2011.2129870
Filename :
5734813
Link To Document :
بازگشت