• DocumentCode
    3518103
  • Title

    What is happening in a still picture?

  • Author

    Li, Piji ; Ma, Jun

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
  • fYear
    2011
  • fDate
    28-28 Nov. 2011
  • Firstpage
    32
  • Lastpage
    36
  • Abstract
    We consider the problem of generating concise sentences to describe still pictures automatically. We treat objects in images (nouns in sentences) as hidden information of actions (verbs). Therefore, the sentence generation problem can be transformed into action detection and scene classification problems. We employ Latent Multiple Kernel Learning (L-MKL) to learn the action detectors from “Exemplarlets”, and utilize MKL to learn the scene classifiers. The image features employed include distribution of edges, dense visual words and feature descriptors at different levels of spatial pyramid. For a new image we can detect the action using a sliding-window detector learnt via L-MKL, predict the scene the action happened in and build haction, scenei tuples. Finally, these tuples will be translated into concise sentences according to previously defined grammar template. We show both the classification and sentence generating results on our newly collected dataset of six actions as well as demonstrate improved performance over existing methods.
  • Keywords
    edge detection; feature extraction; image classification; learning (artificial intelligence); natural scenes; object detection; Exemplarlets; L-MKL; action detection; dense visual words; edge distribution; feature descriptors; hidden information; image features; latent multiple kernel learning; object detection; scene classification problem; sentence generation problem; sliding-window detector; spatial pyramid; still picture; Gold; Libraries; action classification; exemplarlets; multiple kernel learning; sentence generation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ACPR), 2011 First Asian Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4577-0122-1
  • Type

    conf

  • DOI
    10.1109/ACPR.2011.6166555
  • Filename
    6166555