• DocumentCode
    2025
  • Title

    Towards Mobile Document Image Retrieval for Digital Library

  • Author

    Ling-Yu Duan ; Rongrong Ji ; Zhang Chen ; Tiejun Huang ; Wen Gao

  • Author_Institution
    Sch. of Electron. Eng. & Comput. Sci., Peking Univ., Beijing, China
  • Volume
    16
  • Issue
    2
  • fYear
    2014
  • fDate
    Feb. 2014
  • Firstpage
    346
  • Lastpage
    359
  • Abstract
    With the proliferation of mobile devices, recent years have witnessed an emerging potential to integrate mobile visual search techniques into digital library. Such a mobile application scenario in digital library has posed significant and unique challenges in document image search. The mobile photograph makes it tough to extract discriminative features from the landmark regions of documents, like line drawings, as well as text layouts. In addition, both search scalability and query delivery latency remain challenging issues in mobile document search. The former relies on an effective yet memory-light indexing structure to accomplish fast online search, while the latter puts a bit budget constraint of query images over the wireless link. In this paper, we propose a novel mobile document image retrieval framework, consisting of a robust Local Inner-distance Shape Context (LISC) descriptor of line drawings, a Hamming distance KD-Tree for scalable and memory-light document indexing, as well as a JBIG2 based query compression scheme, together with a Retinex based enhancement and an OTSU based binarization, to reduce the latency of delivering query while maintaining query quality in terms of search performance. We have extensively validated the key techniques in this framework by quantitative comparison to alternative approaches.
  • Keywords
    digital libraries; document image processing; image retrieval; mobile computing; text analysis; trees (mathematics); JBIG2 based query compression scheme; OTSU based binarization; Retinex based enhancement; digital library; memory-light document indexing; mobile document image retrieval; mobile visual search techniques; query delivery latency; query images; robust local inner-distance shape context descriptor; search scalability; text layouts; wireless link; Context; Histograms; Image coding; Libraries; Mobile communication; Shape; Visualization; Digital library; Hamming space; JBIG2 compression; K-D tree; inner-distance; line drawing retrieval; mobile visual search; shape context;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2013.2293063
  • Filename
    6675889