DocumentCode :
3549027
Title :
A two level approach for scene recognition
Author :
Le Lu ; Toyama, K. ; Hager, Gregory D.
Author_Institution :
Dept. of Comput. Sci., Johns Hopkins Univ., Baltimore, MD, USA
Volume :
1
fYear :
2005
fDate :
20-25 June 2005
Firstpage :
688
Abstract :
Classifying pictures into one of several semantic categories is a classical image understanding problem. In this paper, we present a stratified approach to both binary (outdoor-indoor) and multiple category of scene classification. We first learn mixture models for 20 basic classes of local image content based on color and texture information. Once trained, these models are applied to a test image, and produce 20 probability density response maps (PDRM) indicating the likelihood that each image region was produced by each class. We then extract some very simple features from those PDRMs, and use them to train a bagged LDA classifier for 10 scene categories. For this process, no explicit region segmentation or spatial context model are computed. To test this classification system, we created a labeled database of 1500 photos taken under very different environment and lighting conditions, using different cameras, and from 43 persons over 5 years. The classification rate of outdoor-indoor classification is 93.8%, and the classification rate for 10 scene categories is 90.1%. As a byproduct, local image patches can be contextually labeled into the 20 basic material classes by using loopy belief propagation (Yedidia et al., 2001) as an anisotropic filter on PDRMs, producing an image-level segmentation if desired.
Keywords :
belief maintenance; feature extraction; image classification; image colour analysis; image segmentation; image texture; learning (artificial intelligence); probability; anisotropic filter; feature extraction; image understanding; image-level segmentation; loopy belief propagation; mixture model learning; picture classification; probability density response maps; scene classification; scene recognition; Belief propagation; Cameras; Context modeling; Data mining; Image databases; Image segmentation; Layout; Linear discriminant analysis; Spatial databases; System testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on
ISSN :
1063-6919
Print_ISBN :
0-7695-2372-2
Type :
conf
DOI :
10.1109/CVPR.2005.51
Filename :
1467335
Link To Document :
بازگشت