DocumentCode :
2717504
Title :
Automatic discovery of groups of objects for scene understanding
Author :
Li, Congcong ; Parikh, Devi ; Chen, Tsuhan
fYear :
2012
fDate :
16-21 June 2012
Firstpage :
2735
Lastpage :
2742
Abstract :
Objects in scenes interact with each other in complex ways. A key observation is that these interactions manifest themselves as predictable visual patterns in the image. Discovering and detecting these structured patterns is an important step towards deeper scene understanding. It goes beyond using either individual objects or the scene as a whole as the semantic unit. In this work, we promote "groups of objects". They are high-order composites of objects that demonstrate consistent spatial, scale, and viewpoint interactions with each other. These groups of objects are likely to correspond to a specific layout of the scene. They can thus provide cues for the scene category and can also prime the likely locations of other objects in the scene. It is not feasible to manually generate a list of all possible groupings of objects we find in our visual world. Hence, we propose an algorithm that automatically discovers groups of arbitrary numbers of participating objects from a collection of images labeled with object categories. Our approach builds a 4-dimensional transform space of location, scale and viewpoint, and efficiently identifies all recurring compositions of objects across images. We then model the discovered groups of objects using the deformable parts-based model. Our experiments on a variety of datasets show that using groups of objects can significantly boost the performance of object detection and scene categorization.
Keywords :
object detection; 4-dimensional transform space; automatic discovery; deformable parts-based model; groups-of-objects; image collection; object category; object composition; object detection; predictable visual pattern; scene categorization; scene category; scene layout; scene understanding; semantic unit; structured pattern detection; structured pattern discovery; visual world; Clustering algorithms; Deformable models; Detectors; Object detection; Training; Transforms; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
Conference_Location :
Providence, RI
ISSN :
1063-6919
Print_ISBN :
978-1-4673-1226-4
Electronic_ISBN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2012.6247996
Filename :
6247996
Link To Document :
بازگشت