• DocumentCode
    1938794
  • Title

    A Hierarchical Algorithm for Document-Images Fast Matching

  • Author

    Tan, Shuang ; Jia, Yan ; Pu, Yiguo ; Fan, Hua ; Wang, Tao

  • Author_Institution
    Comput. Sci., Nat. Univ. of Defense Technol. Changsha, Changsha, China
  • fYear
    2011
  • fDate
    5-7 Aug. 2011
  • Firstpage
    28
  • Lastpage
    33
  • Abstract
    As digital Libraries and document images largely use in the network, how to retrieve them quickly become one of the key issues. This paper presents a hierarchical matching algorithm to achieve fast retrieval of document images. Firstly, we can quickly find the possible matching location through approximate string matching algorithms, and then, use this location as a reference point in the target image, extract sub-block which has the same size as template image and compute the correlation coefficient between sub-block and template image. According to correlation coefficient values, we can accurately know whether the template image´s information exist in the target document, finally, experimental results demonstrate the feasibiLity of the algorithm.
  • Keywords
    correlation methods; digital libraries; document image processing; feature extraction; image matching; image retrieval; string matching; approximate string matching algorithm; correlation coefficient; digital library; document image fast matching; document image retrieval; hierarchical matching algorithm; subblock extraction; target image reference point; template image information; Algorithm design and analysis; Approximation algorithms; Character recognition; Correlation; Image matching; Optical character recognition software; Image Retrieval; approximate string matching; correlation coefficient; document images;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Manufacturing and Automation (ICDMA), 2011 Second International Conference on
  • Conference_Location
    Zhangjiajie, Hunan
  • Print_ISBN
    978-1-4577-0755-1
  • Electronic_ISBN
    978-0-7695-4455-7
  • Type

    conf

  • DOI
    10.1109/ICDMA.2011.16
  • Filename
    6051845