DocumentCode :
3497659
Title :
Separating lines of text in free-form handwritten historical documents
Author :
Kennard, Douglas J. ; Barrett, William A.
Author_Institution :
Dept. of Comput. Sci., Brigham Young Univ., Provo, UT
fYear :
2006
fDate :
27-28 April 2006
Lastpage :
23
Abstract :
We present an approach to finding (and separating) lines of text in free-form handwritten historical document images. After preprocessing, our method uses the count of foreground/background transitions in a binarized image to determine areas of the document that are likely to be text lines. Alternatively, an adaptive local connectivity map (ALCM) found in the literature can be used for this step of the process. We then use a min-cut/max-flow graph cut algorithm to split up text areas that appear to encompass more than one line of text. After removing text lines containing relatively little text information (or merging them with nearby text lines), we create output images for each line. A grayscale output image is created, as well as a special mask image containing both the foreground and information flagging ambiguous pixels. Foreground pixels that belong to other text lines are removed from the output images to provide cleaner line images useful for further processing. While some refinement is still necessary, the result of early experimentation with our method is encouraging
Keywords :
document image processing; handwritten character recognition; history; image segmentation; adaptive local connectivity map; binarized image; foreground pixels; free-form handwritten historical document images; grayscale output image; information flagging ambiguous pixels; min-cut/max-flow graph cut algorithm; special mask image; text line separation; Background noise; Computer science; Degradation; Gray-scale; Handwriting recognition; Image recognition; Indexing; Merging; Pixel; Text recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Image Analysis for Libraries, 2006. DIAL '06. Second International Conference on
Conference_Location :
Lyon
Print_ISBN :
0-7695-2531-8
Type :
conf
DOI :
10.1109/DIAL.2006.40
Filename :
1612942
Link To Document :
بازگشت