Title :
Modeling mutual context of object and human pose in human-object interaction activities
Author :
Yao, Bangpeng ; Fei-Fei, Li
Author_Institution :
Comput. Sci. Dept., Stanford Univ., Stanford, CA, USA
Abstract :
Detecting objects in cluttered scenes and estimating articulated human body parts are two challenging problems in computer vision. The difficulty is particularly pronounced in activities involving human-object interactions (e.g. playing tennis), where the relevant object tends to be small or only partially visible, and the human body parts are often self-occluded. We observe, however, that objects and human poses can serve as mutual context to each other - recognizing one facilitates the recognition of the other. In this paper we propose a new random field model to encode the mutual context of objects and human poses in human-object interaction activities. We then cast the model learning task as a structure learning problem, of which the structural connectivity between the object, the overall human pose, and different body parts are estimated through a structure search approach, and the parameters of the model are estimated by a new max-margin algorithm. On a sports data set of six classes of human-object interactions, we show that our mutual context model significantly outperforms state-of-the-art in detecting very difficult objects and human poses.
Keywords :
computer vision; learning (artificial intelligence); object detection; optimisation; pose estimation; random processes; computer vision; human pose; human-object interaction activity; max-margin algorithm; model learning task; object detection; random field model; structure learning problem; structure search approach; Context modeling; Humans;
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
978-1-4244-6984-0
DOI :
10.1109/CVPR.2010.5540235