• DocumentCode
    410034
  • Title

    Chinese document image retrieval system based on proportion of black pixel area in a character image

  • Author

    Ching-Lin Wang ; Cher, T. ; Yung-Kuan Chan ; Ren-Hung Hwang ; Wan-Wen Huang

  • Author_Institution
    NationaI Chung Cheng University
  • Volume
    1
  • fYear
    2004
  • fDate
    9-11 Feb. 2004
  • Firstpage
    25
  • Lastpage
    29
  • Abstract
    In order to preserve the original state of a document, a document is usually saved in computer in image format as backup data after a scanner scans it. Presently, many retrieval systems used to deal with this sort of duplicate document images have been proposed, but most of them are only suitable for English duplicate document images. This paper proposes a system for Chinese duplicate document images, which uses the proportion of black pixel area in each character image as the feature of this character image. According to experimental results, the proposed system can efficiently find out the desired duplicate document image.
  • Keywords
    Character recognition; Computer science; Image databases; Image retrieval; Image segmentation; Information management; Information retrieval; Optical character recognition software; Pixel; Spatial databases; Dynamic Programming; LCS (Longest Common Subsequence); OCR (optical character recognition); character segmentation; document image; image matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Communication Technology, 2004. The 6th International Conference on
  • Conference_Location
    Phoenix Park, Korea
  • Print_ISBN
    89-5519-119-7
  • Type

    conf

  • DOI
    10.1109/ICACT.2004.1292823
  • Filename
    1292823