• DocumentCode
    2717504
  • Title

    Automatic discovery of groups of objects for scene understanding

  • Author

    Li, Congcong ; Parikh, Devi ; Chen, Tsuhan

  • fYear
    2012
  • fDate
    16-21 June 2012
  • Firstpage
    2735
  • Lastpage
    2742
  • Abstract
    Objects in scenes interact with each other in complex ways. A key observation is that these interactions manifest themselves as predictable visual patterns in the image. Discovering and detecting these structured patterns is an important step towards deeper scene understanding. It goes beyond using either individual objects or the scene as a whole as the semantic unit. In this work, we promote "groups of objects". They are high-order composites of objects that demonstrate consistent spatial, scale, and viewpoint interactions with each other. These groups of objects are likely to correspond to a specific layout of the scene. They can thus provide cues for the scene category and can also prime the likely locations of other objects in the scene. It is not feasible to manually generate a list of all possible groupings of objects we find in our visual world. Hence, we propose an algorithm that automatically discovers groups of arbitrary numbers of participating objects from a collection of images labeled with object categories. Our approach builds a 4-dimensional transform space of location, scale and viewpoint, and efficiently identifies all recurring compositions of objects across images. We then model the discovered groups of objects using the deformable parts-based model. Our experiments on a variety of datasets show that using groups of objects can significantly boost the performance of object detection and scene categorization.
  • Keywords
    object detection; 4-dimensional transform space; automatic discovery; deformable parts-based model; groups-of-objects; image collection; object category; object composition; object detection; predictable visual pattern; scene categorization; scene category; scene layout; scene understanding; semantic unit; structured pattern detection; structured pattern discovery; visual world; Clustering algorithms; Deformable models; Detectors; Object detection; Training; Transforms; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
  • Conference_Location
    Providence, RI
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4673-1226-4
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2012.6247996
  • Filename
    6247996