Title :
Processing of text documents: straight line approximation and lost loop recovery
Author :
Abuhaiba, I.S.I. ; Datta, S. ; Holt, M.J.J.
Author_Institution :
Dept. of Comput. Eng., King Saud Univ., Riyadh, Saudi Arabia
Abstract :
This paper deals with two different problems in processing of text documents. Firstly, an integrated algorithm that finds a straight line approximation of a textual stroke is described. It has the advantage of using the distance transform of thinned binary images to identify, spurious bifurcation points which are unavoidable when thinning algorithms are used, remove them, and recover the original ones. Secondly, a method is presented to recover loops that become blobs due to blotting. The method depends on removing the pixels whose distance transform exceeds a calculated threshold
Keywords :
document image processing; pattern recognition; distance transform; lost loop recovery; spurious bifurcation; straight line approximation; text documents; thinned binary images; thinning algorithms; Approximation algorithms; Bifurcation; Digital images; Head; Image segmentation; Ink; Joining processes; Radiofrequency interference; Skeleton; Text recognition;
Conference_Titel :
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
Conference_Location :
Montreal, Que.
Print_ISBN :
0-8186-7128-9
DOI :
10.1109/ICDAR.1995.602127