DocumentCode
2013228
Title
Retrieval of Handwritten Lines in Historical Documents
Author
Schomaker, L.R.B.
Volume
2
fYear
2007
fDate
23-26 Sept. 2007
Firstpage
594
Lastpage
598
Abstract
This study describes methods for the retrieval of handwritten lines of text in a historical administrative collection. The goal is to develop generic methods for bootstrapping the retrieval system from a tabula rasa starting condition, i.e., the virtual absence of labeled samples. By exploiting the currently available computing power and the fact that computation takes place off line, it should be possible to provide a good starting point for statistical learning methods. In this manner, a closed collection can be incrementally indexed. A cross-correlation method on line-strip images is presented and results are compared to feature-based methods.
Keywords
administrative data processing; document image processing; feature extraction; handwriting recognition; information retrieval; learning (artificial intelligence); text analysis; bootstrapping; cross-correlation method; feature-based methods; handwritten line retrieval; handwritten text; historical administrative collection; historical documents; line-strip images; statistical learning; tabula rasa starting condition; Handwriting recognition; Humans; Image quality; Image retrieval; Image segmentation; Information retrieval; Labeling; Optical character recognition software; Strips; Text recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location
Parana
ISSN
1520-5363
Print_ISBN
978-0-7695-2822-9
Type
conf
DOI
10.1109/ICDAR.2007.4376984
Filename
4376984
Link To Document