DocumentCode :
3425913
Title :
Codemaps - Segment, Classify and Search Objects Locally
Author :
Zhenyang Li ; Gavves, Efstratios ; van de Sande, Koen E. A. ; Snoek, Cees G. M. ; Smeulders, Arnold W. M.
Author_Institution :
Inf. Inst., Univ. of Amsterdam, Amsterdam, Netherlands
fYear :
2013
fDate :
1-8 Dec. 2013
Firstpage :
2136
Lastpage :
2143
Abstract :
In this paper we aim for segmentation and classification of objects. We propose codemaps that are a joint formulation of the classification score and the local neighborhood it belongs to in the image. We obtain the codemap by reordering the encoding, pooling and classification steps over lattice elements. Other than existing linear decompositions who emphasize only the efficiency benefits for localized search, we make three novel contributions. As a preliminary, we provide a theoretical generalization of the sufficient mathematical conditions under which image encodings and classification becomes locally decomposable. As first novelty we introduce l2 normalization for arbitrarily shaped image regions, which is fast enough for semantic segmentation using our Fisher codemaps. Second, using the same lattice across images, we propose kernel pooling which embeds nonlinearities into codemaps for object classification by explicit or approximate feature mappings. Results demonstrate that l2 normalized Fisher codemaps improve the state-of-the-art in semantic segmentation for PASCAL VOC. For object classification the addition of nonlinearities brings us on par with the state-of-the-art, but is 3x faster. Because of the codemaps´ inherent efficiency, we can reach significant speed-ups for localized search as well. We exploit the efficiency gain for our third novelty: object segment retrieval using a single query image only.
Keywords :
image classification; image coding; image retrieval; image segmentation; PASCAL VOC; arbitrarily-shaped image regions; classification score; classification step reordering; codemap inherent efficiency; efficiency gain; encoding step reordering; feature mapping; image encoding; kernel pooling; l2 normalization; l2 normalized Fisher codemaps; lattice elements; linear decomposition; local neighborhood; mathematical condition; object classification; object local search; object segment retrieval; object segmentation; pooling step reordering; semantic segmentation; single-query image; theoretical generalization; Encoding; Image coding; Image segmentation; Kernel; Lattices; Semantics; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision (ICCV), 2013 IEEE International Conference on
Conference_Location :
Sydney, NSW
ISSN :
1550-5499
Type :
conf
DOI :
10.1109/ICCV.2013.454
Filename :
6751376
Link To Document :
بازگشت