DocumentCode :
729728
Title :
Predicting image caption by a unified hierarchical model
Author :
Lin Bai ; Kan Li
Author_Institution :
Beijing Inst. of Technol., Beijing, China
fYear :
2015
fDate :
June 29 2015-July 3 2015
Firstpage :
1
Lastpage :
6
Abstract :
Automatically describing the content of an image is a challenging task in artificial intelligence. The difficulty is particularly pronounced in activity recognition and the image caption revealed by the relationship analysis of the activities involved in the image. This paper presents a unified hierarchical model to model the interaction activity between human and nearby object, and then speculates the image content by analyzing the logical relationship among the interaction activities. In our model, the first-layer factored three-way interaction machine models the 3D spatial context between human and the relevant object to straightly aid the prediction of human-object interaction activities. Then, the activities are further processed through the top-layer factored three-way interaction machine to learn the image content with the help of 3D spatial context among the activities. Experiments on joint dataset show that our unified hierarchical model outperforms state-of-the-arts in predicting human-object interaction activities and describing the image caption.
Keywords :
image motion analysis; learning (artificial intelligence); activity recognition; artificial intelligence; human-object interaction activity; image caption; image content; three-way interaction machine model; unified hierarchical model; Computational modeling; Context; Context modeling; Mathematical model; Predictive models; Solid modeling; Three-dimensional displays; 3D spatial context; Factored three-way interaction; Human-object interaction activity; Image caption; Unified hierarchical model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo (ICME), 2015 IEEE International Conference on
Conference_Location :
Turin
Type :
conf
DOI :
10.1109/ICME.2015.7177427
Filename :
7177427
Link To Document :
بازگشت