Title :
Line detection and segmentation in historical church registers
Author :
Feldbach, Markus ; Tönnies, Klaus D.
Author_Institution :
Dept. of Simulation & Graphics, Otto-von-Guericke Univ. of Magdeburg, Germany
fDate :
6/23/1905 12:00:00 AM
Abstract :
For being able to automatically acquire the information recorded in church registers and other historical scriptures, the writing on these documents has to be recognized. This paper describes algorithms for transforming the paper documents into a representation of text apt to be used as input for an automatic text recognizer. The automatic recognition of old handwritten scriptures is difficult for two main reasons. Lines of text in general are not straight and ascenders and descenders of adjacent lines interfere. The algorithms described in this paper provide ways to reconstruct the path of the lines of text using an approach of gradually constructing line segments until a unique line of text is formed. In addition, the single lines are segmented and an output in form of a raster image is provided. The method was applied to church registers. They were written between the 17th and 19th Century. Line segmentation was found to be successful in 97% of all samples
Keywords :
document image processing; handwritten character recognition; history; image segmentation; optical character recognition; OCR; automatic text recognition; document image processing; handwritten scripture recognition; historical church registers; historical scriptures; line detection; line segmentation; paper documents; raster image; text representation; Computational modeling; Computer graphics; Computer simulation; Computer vision; Handwriting recognition; Image reconstruction; Image segmentation; Registers; Text recognition; Writing;
Conference_Titel :
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7695-1263-1
DOI :
10.1109/ICDAR.2001.953888