DocumentCode
3147155
Title
Learning multiple codebooks for low bit rate mobile visual search
Author
Jie Lin ; Ling-Yu Duan ; Jie Chen ; Rongrong Ji ; Siwei Luo ; Wen Gao
Author_Institution
Sch. of Comput. & Inf. Technol., Beijing Jiaotong Univ., Beijing, China
fYear
2012
fDate
25-30 March 2012
Firstpage
933
Lastpage
936
Abstract
Compressing a query image´s signature via vocabulary coding is an effective approach to low bit rate mobile visual search. State-of-the-art methods concentrate on offline learning a codebook from an initial large vocabulary. Over a large heterogeneous reference database, learning a single codebook may not suffice for maximally removing redundant codewords for vocabulary based compact descriptor. In this paper, we propose to learn multiple codebooks (m-Codebooks) for extremely compressing image signatures. A query-specific codebook (q-Codebook) is online generated at both client and server sides by adaptively weighting the off-line learned multiple codebooks. The q-Codebook is subsequently employed to quantize the query image for producing compact, discriminative, and scalable descriptors. As q-Codebook may be simultaneously generated at both sides, without transmitting the entire vocabulary, only small overhead (e.g. codebook ID and codeword 0/1 index) is incurred to reconstruct the query signature at the server end. To fulfill m-Codebooks and q-Codebook, we adopt a Bi-layer Sparse Coding method to learn the sparse relationships of codewords vs. codebooks as well as codebooks vs. query images via l1 regularization. Experiments on benchmarking datasets have demonstrated the extremely small descriptor´s supervior performance in image retrieval.
Keywords
image coding; image retrieval; learning (artificial intelligence); mobile computing; bilayer sparse coding; image retrieval; image signature compression; large heterogeneous reference database; low bit rate mobile visual search; m-Codebooks; multiple codebooks; offline learning; q-Codebook; query image signature; sparse relationships; vocabulary coding; Bit rate; Encoding; Image coding; Mobile communication; Quantization; Visualization; Vocabulary; Mobile visual search; compact descriptor; universal quantization; visual vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6288038
Filename
6288038
Link To Document