Title :
Automated borders detection and adaptive segmentation for binary document images
Author :
Le, Daniel X. ; Thoma, George R. ; Wechsler, Harry
Author_Institution :
Nat. Libr. of Med., Bethesda, MD, USA
Abstract :
This paper describes two new and effective algorithms: one for detecting the page borders for documents available as binary images, and the other an adaptive segmentation algorithm using a bottom-up approach for segmenting binary images into blocks. The borders detection algorithm relies upon the classification of blank/textual/non-textual rows and columns, objects segmentation, and an analysis of projection profiles and crossing counts. Segmentation, done by an adaptive smearing technique, is different from all previous bottom-up approaches because any decisions on merging and/or separating are based on the estimated font information in binary document images
Keywords :
document image processing; feature extraction; image classification; image segmentation; adaptive segmentation; adaptive smearing technique; automated borders detection; binary document images; crossing counts; estimated font information; objects segmentation; page borders; projection profiles; Biomedical imaging; Books; Clustering algorithms; Computer science; Detection algorithms; Image converters; Image segmentation; Libraries; Merging; Text analysis;
Conference_Titel :
Pattern Recognition, 1996., Proceedings of the 13th International Conference on
Conference_Location :
Vienna
Print_ISBN :
0-8186-7282-X
DOI :
10.1109/ICPR.1996.547266