Title :
Learning reconfigurable scene representation by tangram model
Author :
Jun Zhu ; Wu, Tianfu ; Zhu, Jun ; Yang, Xiaokang ; Zhang, Wenjun
Author_Institution :
Inst. of Image Commun. & Inf. Process., Shanghai Jiao Tong Univ., Shanghai, China
Abstract :
This paper proposes a method to learn reconfigurable and sparse scene representation in the joint space of spatial configuration and appearance in a principled way. We call it the tangram model, which has three properties: (1) Unlike fixed structure of the spatial pyramid widely used in the literature, we propose a compositional shape dictionary organized in an And-Or directed acyclic graph (AOG) to quantize the space of spatial configurations. (2) The shape primitives (called tans) in the dictionary can be described by using any “off-the-shelf” appearance features according to different tasks. (3) A dynamic programming (DP) algorithm is utilized to learn the globally optimal parse tree in the joint space of spatial configuration and appearance. We demonstrate the tangram model in both a generative learning formulation and a discriminative matching kernel. In experiments, we show that the tangram model is capable of capturing meaningful spatial configurations as well as appearance for various scene categories, and achieves state-of-the-art classification performance on the LSP 15-class scene dataset and the MIT 67-class indoor scene dataset.
Keywords :
directed graphs; dynamic programming; image classification; image matching; image representation; learning (artificial intelligence); natural scenes; trees (mathematics); LSP 15-class scene dataset; MIT 67-class indoor scene dataset; and-or directed acyclic graph; compositional shape dictionary; discriminative matching kernel; dynamic programming algorithm; generative learning; off-the-shelf appearance features; optimal parse tree; reconfigurable sparse scene representation; spatial configuration; spatial pyramid; state-of-the-art classification; tangram model; Adaptation models; Dictionaries; Kernel; Lattices; Matching pursuit algorithms; Shape; Tiles;
Conference_Titel :
Applications of Computer Vision (WACV), 2012 IEEE Workshop on
Conference_Location :
Breckenridge, CO
Print_ISBN :
978-1-4673-0233-3
Electronic_ISBN :
1550-5790
DOI :
10.1109/WACV.2012.6163023