DocumentCode :
1638310
Title :
Segmentation-free Word Spotting in Historical Printed Documents
Author :
Gatos, B. ; Pratikakis, I.
Author_Institution :
Comput. Intell. Lab., Nat. Res. Center Demokritos, Athens, Greece
fYear :
2009
Firstpage :
271
Lastpage :
275
Abstract :
In this paper, a new efficient word spotting methodology is presented that can be applied to historical printed documents without requiring any previous block or word segmentation step. Our aim is to address a methodology which is segmentation-free since in many cases of historical documents, the segmentation process does not produce meaningful results due to unconstraint layout, several degradations or typesetting imperfections. The proposed method is based on block-based document image descriptors that are used at a template matching process satisfying invariance in terms of translation, rotation and scaling. Improvement in terms of time expense is obtained by applying the matching process only on salient regions of the image. Experimental results on a database with representative historical printed documents prove the efficiency of the proposed approach.
Keywords :
document image processing; history; image matching; block-based document image descriptor; historical printed document image; image rotation; image scaling; image translation; segmentation-free word spotting; template matching process; typesetting imperfection; unconstraint layout; Computational intelligence; Degradation; Image databases; Image retrieval; Image segmentation; Informatics; Laboratories; Optical character recognition software; Text analysis; Typesetting; Historical Documents; Segmentation-free analysis; Word Spotting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
Conference_Location :
Barcelona
ISSN :
1520-5363
Print_ISBN :
978-1-4244-4500-4
Electronic_ISBN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2009.236
Filename :
5277703
Link To Document :
بازگشت