DocumentCode :
3518103
Title :
What is happening in a still picture?
Author :
Li, Piji ; Ma, Jun
Author_Institution :
Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
fYear :
2011
fDate :
28-28 Nov. 2011
Firstpage :
32
Lastpage :
36
Abstract :
We consider the problem of generating concise sentences to describe still pictures automatically. We treat objects in images (nouns in sentences) as hidden information of actions (verbs). Therefore, the sentence generation problem can be transformed into action detection and scene classification problems. We employ Latent Multiple Kernel Learning (L-MKL) to learn the action detectors from “Exemplarlets”, and utilize MKL to learn the scene classifiers. The image features employed include distribution of edges, dense visual words and feature descriptors at different levels of spatial pyramid. For a new image we can detect the action using a sliding-window detector learnt via L-MKL, predict the scene the action happened in and build haction, scenei tuples. Finally, these tuples will be translated into concise sentences according to previously defined grammar template. We show both the classification and sentence generating results on our newly collected dataset of six actions as well as demonstrate improved performance over existing methods.
Keywords :
edge detection; feature extraction; image classification; learning (artificial intelligence); natural scenes; object detection; Exemplarlets; L-MKL; action detection; dense visual words; edge distribution; feature descriptors; hidden information; image features; latent multiple kernel learning; object detection; scene classification problem; sentence generation problem; sliding-window detector; spatial pyramid; still picture; Gold; Libraries; action classification; exemplarlets; multiple kernel learning; sentence generation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ACPR), 2011 First Asian Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4577-0122-1
Type :
conf
DOI :
10.1109/ACPR.2011.6166555
Filename :
6166555
Link To Document :
بازگشت