مرکز منطقه ای اطلاع رساني علوم و فناوري - Automatic discovery of groups of objects for scene understanding

DocumentCode :

2717504

Title :

Automatic discovery of groups of objects for scene understanding

Author :

Li, Congcong ; Parikh, Devi ; Chen, Tsuhan

fYear :

2012

fDate :

16-21 June 2012

Firstpage :

2735

Lastpage :

2742

Abstract :

Objects in scenes interact with each other in complex ways. A key observation is that these interactions manifest themselves as predictable visual patterns in the image. Discovering and detecting these structured patterns is an important step towards deeper scene understanding. It goes beyond using either individual objects or the scene as a whole as the semantic unit. In this work, we promote "groups of objects". They are high-order composites of objects that demonstrate consistent spatial, scale, and viewpoint interactions with each other. These groups of objects are likely to correspond to a specific layout of the scene. They can thus provide cues for the scene category and can also prime the likely locations of other objects in the scene. It is not feasible to manually generate a list of all possible groupings of objects we find in our visual world. Hence, we propose an algorithm that automatically discovers groups of arbitrary numbers of participating objects from a collection of images labeled with object categories. Our approach builds a 4-dimensional transform space of location, scale and viewpoint, and efficiently identifies all recurring compositions of objects across images. We then model the discovered groups of objects using the deformable parts-based model. Our experiments on a variety of datasets show that using groups of objects can significantly boost the performance of object detection and scene categorization.

Keywords :

object detection; 4-dimensional transform space; automatic discovery; deformable parts-based model; groups-of-objects; image collection; object category; object composition; object detection; predictable visual pattern; scene categorization; scene category; scene layout; scene understanding; semantic unit; structured pattern detection; structured pattern discovery; visual world; Clustering algorithms; Deformable models; Detectors; Object detection; Training; Transforms; Visualization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on

Conference_Location :

Providence, RI

ISSN :

1063-6919

Print_ISBN :

978-1-4673-1226-4

Electronic_ISBN :

1063-6919

Type :

conf

DOI :

10.1109/CVPR.2012.6247996

Filename :

6247996

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2717504