• DocumentCode
    3147155
  • Title

    Learning multiple codebooks for low bit rate mobile visual search

  • Author

    Jie Lin ; Ling-Yu Duan ; Jie Chen ; Rongrong Ji ; Siwei Luo ; Wen Gao

  • Author_Institution
    Sch. of Comput. & Inf. Technol., Beijing Jiaotong Univ., Beijing, China
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    933
  • Lastpage
    936
  • Abstract
    Compressing a query image´s signature via vocabulary coding is an effective approach to low bit rate mobile visual search. State-of-the-art methods concentrate on offline learning a codebook from an initial large vocabulary. Over a large heterogeneous reference database, learning a single codebook may not suffice for maximally removing redundant codewords for vocabulary based compact descriptor. In this paper, we propose to learn multiple codebooks (m-Codebooks) for extremely compressing image signatures. A query-specific codebook (q-Codebook) is online generated at both client and server sides by adaptively weighting the off-line learned multiple codebooks. The q-Codebook is subsequently employed to quantize the query image for producing compact, discriminative, and scalable descriptors. As q-Codebook may be simultaneously generated at both sides, without transmitting the entire vocabulary, only small overhead (e.g. codebook ID and codeword 0/1 index) is incurred to reconstruct the query signature at the server end. To fulfill m-Codebooks and q-Codebook, we adopt a Bi-layer Sparse Coding method to learn the sparse relationships of codewords vs. codebooks as well as codebooks vs. query images via l1 regularization. Experiments on benchmarking datasets have demonstrated the extremely small descriptor´s supervior performance in image retrieval.
  • Keywords
    image coding; image retrieval; learning (artificial intelligence); mobile computing; bilayer sparse coding; image retrieval; image signature compression; large heterogeneous reference database; low bit rate mobile visual search; m-Codebooks; multiple codebooks; offline learning; q-Codebook; query image signature; sparse relationships; vocabulary coding; Bit rate; Encoding; Image coding; Mobile communication; Quantization; Visualization; Vocabulary; Mobile visual search; compact descriptor; universal quantization; visual vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288038
  • Filename
    6288038