Title :
A background based adaptive page segmentation algorithm
Author :
Normand, N. ; Viard-Gaudin, C.
Author_Institution :
Lab. Syst. Electron. et Inf., IRESTE, Nantes, France
Abstract :
A novel page segmentation algorithm is provided in this paper. Based on the extraction of the background, it offers the benefit of being adaptive to the context of the document and to be insensitive to the orientation of the text blocks. It involves a two-dimensional isotropic structuring element used to characterized the white streams. This element is a disk approximated by a regular octagon which can be recursively generated. Another advantage of the proposed method is that a hierarchical segmentation can be derived from the image built upon the octagonal pattern. This tree allows to perform an isotropic multi-scale smearing, which leads to a physical segmentation. The algorithms are based on an input-time tracing principle and use a single scan of the image, they are very well suited to a real-time implementation
Keywords :
document image processing; feature extraction; image segmentation; optical character recognition; real-time systems; 2D isotropic structuring; background based adaptive page segmentation; background extraction; hierarchical segmentation; image scan; input-time tracing principle; isotropic multiscale smearing; octagonal pattern; optical character recognition; real-time implementation; regular octagon; text block orientation; tree; two-dimensional isotropic structuring; white streams; Character generation; Character recognition; Image coding; Image segmentation; Optical character recognition software; Optical filters; Real time systems; Shape; Smoothing methods; Streaming media;
Conference_Titel :
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
Conference_Location :
Montreal, Que.
Print_ISBN :
0-8186-7128-9
DOI :
10.1109/ICDAR.1995.598961