مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

2717972

Title :

Towards compact topical descriptors

Author :

Ji, Rongrong ; Duan, Ling-Yu ; Chen, Jie ; Gao, Wen

Author_Institution :

Nat. Eng. Lab. for Video Technol., Peking Univ., Beijing, China

fYear :

2012

fDate :

16-21 June 2012

Firstpage :

2925

Lastpage :

2932

Abstract :

We introduce a Compact Topical Descriptor to learn a compact yet discriminative image signature from the reference image corpus. This descriptor is deployed over the well used bag-of-words image histogram, with two merits over the traditional topical features: First, we propose to directly control the topical sparsity to achieve the descriptor compactness. Second, we ensure the descriptor discriminability by minimizing the bag-of-words reconstruction errors during the topical histogram encoding. To this end, we have a generative viewpoint of the topical feature extraction, which is estimated as a sparse MAP estimation over the original bag-of-words. We learn such estimation by a bi-convex optimization, iterating between both hierarchical sparse coding from words to topical histograms and dictionary learning of the corresponding word-to-topic transform. Especially, supervised labels such as image ranking list can be also incorporated into our descriptor learning paradigm. We quantize our performance in both Im-ageNet 10K and NUS-WIDE, with comparisons to bag-of-words, LDA, miniBoF, and Aggregated Local Descriptors. In practice, we also implement our descriptor for a low bit rate mobile visual search application, i.e. sending compact descriptors instead of the image to reduce the query delivery latency. Our descriptor has significantly outperformed the state-of-the-art compact descriptors by quantitative evaluations over 10 million reference images.

Keywords :

convex programming; feature extraction; image coding; image reconstruction; ImageNet 10K; NUS-WIDE; bag-of-words image histogram; biconvex optimization; compact topical descriptors; discriminative image signature; hierarchical sparse coding; mobile visual search application; reconstruction errors; reference image corpus; sparse MAP estimation; topical feature extraction; topical sparsity; word-to-topic transform; Dictionaries; Equations; Histograms; Image coding; Mathematical model; Mobile communication; Visualization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on

Conference_Location :

Providence, RI

ISSN :

1063-6919

Print_ISBN :

978-1-4673-1226-4

Electronic_ISBN :

1063-6919

Type :

conf

DOI :

10.1109/CVPR.2012.6248020

Filename :

6248020

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2717972