مرکز منطقه ای اطلاع رساني علوم و فناوري - Spatially Coherent Latent Topic Model for Concurrent Segmentation and Classification of Objects and Scenes

DocumentCode :

2459856

Title :

Spatially Coherent Latent Topic Model for Concurrent Segmentation and Classification of Objects and Scenes

Author :

Cao, Liangliang ; Fei-Fei, Li

Author_Institution :

Dept. of Electr. & Comput. Eng., Univ. of Illinois at Urbana-Champaign, Urbana, IL

fYear :

2007

fDate :

14-21 Oct. 2007

Firstpage :

Lastpage :

Abstract :

We present a novel generative model for simultaneously recognizing and segmenting object and scene classes. Our model is inspired by the traditional bag of words representation of texts and images as well as a number of related generative models, including probabilistic latent semantic analysis (pLSA) and latent Dirichlet allocation (LDA). A major drawback of the pLSA and LDA models is the assumption that each patch in the image is independently generated given its corresponding latent topic. While such representation provides an efficient computational method, it lacks the power to describe the visually coherent images and scenes. Instead, we propose a spatially coherent latent topic model (spatial-LTM). Spatial-LTM represents an image containing objects in a hierarchical way by over-segmented image regions of homogeneous appearances and the salient image patches within the regions. Only one single latent topic is assigned to the image patches within each region, enforcing the spatial coherency of the model. This idea gives rise to the following merits of spatial-LTM: (1) spatial-LTM provides a unified representation for spatially coherent bag of words topic models; (2) spatial-LTM can simultaneously segment and classify objects, even in the case of occlusion and multiple instances; and (3) spatial-LTM can be trained either unsupervised or supervised, as well as when partial object labels are provided. We verify the success of our model in a number of segmentation and classification experiments.

Keywords :

image classification; image representation; image segmentation; concurrent segmentation; generative models; image representation; latent dirichlet allocation; object classification; over-segmented image regions; probabilistic latent semantic analysis; salient image patches; scene classification; spatially coherent latent topic model; Computational efficiency; Computer vision; Detectors; Image recognition; Image segmentation; Layout; Linear discriminant analysis; Spatial coherence; Text analysis; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision, 2007. ICCV 2007. IEEE 11th International Conference on

Conference_Location :

Rio de Janeiro

ISSN :

1550-5499

Print_ISBN :

978-1-4244-1630-1

Electronic_ISBN :

1550-5499

Type :

conf

DOI :

10.1109/ICCV.2007.4408965

Filename :

4408965

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2459856