Title :
A language for document generic layout description and its use for segmentation into regions
Author :
Azokly, Antoine ; Ingold, Rolf
Author_Institution :
Inst. of Inf., Fribourg Univ., Switzerland
Abstract :
We present a segmentation method guided by a generic layout description expressed in a new language. The proposed language allows to describe a page as superposed layers that may be used to separate the main text body from other components, for example figures. The language´s novelty resides in the fact that, instead of describing directly the global topology of generic pages according to their regions, generic separators are described and used as region boundary delimiters. Separators may be declared as white spaces or threads. By doing this, the problem of document segmentation into regions has become a problem of separator determination, solved by analyzing lines and white spaces contained in documents
Keywords :
document image processing; image segmentation; page description languages; document generic layout description language; generic layout description; region boundary delimiters; segmentation method; separator determination; Image analysis; Image recognition; Image segmentation; Layout; Page description languages; Particle separators; Strontium; Text analysis; White spaces; Writing;
Conference_Titel :
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
Conference_Location :
Montreal, Que.
Print_ISBN :
0-8186-7128-9
DOI :
10.1109/ICDAR.1995.602117