DocumentCode
3518103
Title
What is happening in a still picture?
Author
Li, Piji ; Ma, Jun
Author_Institution
Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
fYear
2011
fDate
28-28 Nov. 2011
Firstpage
32
Lastpage
36
Abstract
We consider the problem of generating concise sentences to describe still pictures automatically. We treat objects in images (nouns in sentences) as hidden information of actions (verbs). Therefore, the sentence generation problem can be transformed into action detection and scene classification problems. We employ Latent Multiple Kernel Learning (L-MKL) to learn the action detectors from “Exemplarlets”, and utilize MKL to learn the scene classifiers. The image features employed include distribution of edges, dense visual words and feature descriptors at different levels of spatial pyramid. For a new image we can detect the action using a sliding-window detector learnt via L-MKL, predict the scene the action happened in and build haction, scenei tuples. Finally, these tuples will be translated into concise sentences according to previously defined grammar template. We show both the classification and sentence generating results on our newly collected dataset of six actions as well as demonstrate improved performance over existing methods.
Keywords
edge detection; feature extraction; image classification; learning (artificial intelligence); natural scenes; object detection; Exemplarlets; L-MKL; action detection; dense visual words; edge distribution; feature descriptors; hidden information; image features; latent multiple kernel learning; object detection; scene classification problem; sentence generation problem; sliding-window detector; spatial pyramid; still picture; Gold; Libraries; action classification; exemplarlets; multiple kernel learning; sentence generation;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ACPR), 2011 First Asian Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4577-0122-1
Type
conf
DOI
10.1109/ACPR.2011.6166555
Filename
6166555
Link To Document