Title :
Towards compact topical descriptors
Author :
Ji, Rongrong ; Duan, Ling-Yu ; Chen, Jie ; Gao, Wen
Author_Institution :
Nat. Eng. Lab. for Video Technol., Peking Univ., Beijing, China
Abstract :
We introduce a Compact Topical Descriptor to learn a compact yet discriminative image signature from the reference image corpus. This descriptor is deployed over the well used bag-of-words image histogram, with two merits over the traditional topical features: First, we propose to directly control the topical sparsity to achieve the descriptor compactness. Second, we ensure the descriptor discriminability by minimizing the bag-of-words reconstruction errors during the topical histogram encoding. To this end, we have a generative viewpoint of the topical feature extraction, which is estimated as a sparse MAP estimation over the original bag-of-words. We learn such estimation by a bi-convex optimization, iterating between both hierarchical sparse coding from words to topical histograms and dictionary learning of the corresponding word-to-topic transform. Especially, supervised labels such as image ranking list can be also incorporated into our descriptor learning paradigm. We quantize our performance in both Im-ageNet 10K and NUS-WIDE, with comparisons to bag-of-words, LDA, miniBoF, and Aggregated Local Descriptors. In practice, we also implement our descriptor for a low bit rate mobile visual search application, i.e. sending compact descriptors instead of the image to reduce the query delivery latency. Our descriptor has significantly outperformed the state-of-the-art compact descriptors by quantitative evaluations over 10 million reference images.
Keywords :
convex programming; feature extraction; image coding; image reconstruction; ImageNet 10K; NUS-WIDE; bag-of-words image histogram; biconvex optimization; compact topical descriptors; discriminative image signature; hierarchical sparse coding; mobile visual search application; reconstruction errors; reference image corpus; sparse MAP estimation; topical feature extraction; topical sparsity; word-to-topic transform; Dictionaries; Equations; Histograms; Image coding; Mathematical model; Mobile communication; Visualization;
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
Conference_Location :
Providence, RI
Print_ISBN :
978-1-4673-1226-4
Electronic_ISBN :
1063-6919
DOI :
10.1109/CVPR.2012.6248020