• DocumentCode
    3486151
  • Title

    Modeling Local Word Spatial Configurations for Near Duplicate Document Image Retrieval

  • Author

    Li Liu ; Yue Lu ; Suen, Ching ; Jinhua Xu

  • Author_Institution
    Dept. of Comput. Sci. & Technol., East China Normal Univ., Shanghai, China
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    235
  • Lastpage
    239
  • Abstract
    The issue of near duplicate document image retrieval is addressed in this paper, which is characterized by not only encoding each individual word in the image but also modeling its local spatial configuration. On representing each word in the image as a string in terms of its shape characteristics, a lexicon is first learnt from a training set. Then a word in an arbitrary document image can be soft assigned to a weighted combination of several nearest neighbors in the lexicon. The rationale behind soft-assignment is to tolerate the distortions induced by character segmentations which are error-prone in degraded document images. Most importantly, we look beyond the single word and capture the local spatial configuration for each word which plays a very important role in human perception. It provides much useful information in discriminating between different document images compared with the single word. A graph, benefitting from its great representative power, is built for each word to model its relationships with the neighborhoods locally. The local word spatial configurations are integrated within the inverted file index structure to achieve scalable retrieval. Thus the retrieval of near duplicate document images is formulated as a voting problem. Experimental results on 45,000 document images demonstrate that the proposed approach brings significant improvements in successful retrieval of near duplicate images.
  • Keywords
    document image processing; graph theory; image representation; image retrieval; character segmentations; graph; human perception; image representation; inverted file index structure; local word spatial configuration modeling; near duplicate document image retrieval; shape characteristics; voting problem; Accuracy; Image retrieval; Image segmentation; Indexes; Noise; Shape; inverted file indexing; local word spatial configurations; near duplicate document image retrieval; word soft-assignment;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.54
  • Filename
    6628619