DocumentCode
2717504
Title
Automatic discovery of groups of objects for scene understanding
Author
Li, Congcong ; Parikh, Devi ; Chen, Tsuhan
fYear
2012
fDate
16-21 June 2012
Firstpage
2735
Lastpage
2742
Abstract
Objects in scenes interact with each other in complex ways. A key observation is that these interactions manifest themselves as predictable visual patterns in the image. Discovering and detecting these structured patterns is an important step towards deeper scene understanding. It goes beyond using either individual objects or the scene as a whole as the semantic unit. In this work, we promote "groups of objects". They are high-order composites of objects that demonstrate consistent spatial, scale, and viewpoint interactions with each other. These groups of objects are likely to correspond to a specific layout of the scene. They can thus provide cues for the scene category and can also prime the likely locations of other objects in the scene. It is not feasible to manually generate a list of all possible groupings of objects we find in our visual world. Hence, we propose an algorithm that automatically discovers groups of arbitrary numbers of participating objects from a collection of images labeled with object categories. Our approach builds a 4-dimensional transform space of location, scale and viewpoint, and efficiently identifies all recurring compositions of objects across images. We then model the discovered groups of objects using the deformable parts-based model. Our experiments on a variety of datasets show that using groups of objects can significantly boost the performance of object detection and scene categorization.
Keywords
object detection; 4-dimensional transform space; automatic discovery; deformable parts-based model; groups-of-objects; image collection; object category; object composition; object detection; predictable visual pattern; scene categorization; scene category; scene layout; scene understanding; semantic unit; structured pattern detection; structured pattern discovery; visual world; Clustering algorithms; Deformable models; Detectors; Object detection; Training; Transforms; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
Conference_Location
Providence, RI
ISSN
1063-6919
Print_ISBN
978-1-4673-1226-4
Electronic_ISBN
1063-6919
Type
conf
DOI
10.1109/CVPR.2012.6247996
Filename
6247996
Link To Document