Title :
Separation of overlapping text from graphics
Author :
Cao, Ruini ; Tan, Chew Lim
Author_Institution :
Sch. of Comput., Nat. Univ. of Singapore, Singapore
fDate :
6/23/1905 12:00:00 AM
Abstract :
The separation of overlapping text from graphics is a challenging problem in document image analysis. This paper proposes a specific method for detecting and extracting characters that are touching graphics. It is based on the observation that the constituent strokes of characters are usually short segments in comparison with those of graphics. It combines line continuation with the feature line width to decompose and reconstruct segments underlying the region of intersection. Experimental results showed that the proposed method improved the percentage of correctly detected text as well as the accuracy of character recognition significantly
Keywords :
document image processing; image reconstruction; image segmentation; optical character recognition; character detection; character extraction; document image analysis; feature line width; intersection region; line continuation; overlapping text; segment decomposition; segment reconstruction; text-graphic separation; Character recognition; Engineering drawings; Filters; Graphics; Image analysis; Image reconstruction; Image segmentation; Roads; Solids; Text analysis;
Conference_Titel :
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7695-1263-1
DOI :
10.1109/ICDAR.2001.953752