Title :
Document image analysis for generating syntactic structure description
Author :
Tsuji, Yoshitake
Author_Institution :
NEC Corp., Kawasaki, Kanagawa, Japan
Abstract :
A document image analysis is described which automatically converts an input image into a syntactic document tree structure, while simultaneously representing the elements and their relative relations. Top-down image segmentation, using projection profiles, was greatly improved by systematically using a feedback process. As a result, the tree structure, including the blocks and their relative relations, was generated. Both the elements and their relations in the generated tree were finally determined by the bottom-up strategy, based on the general document layout property. Experimental results showed that this proposed method can be appropriately used to automatically describe an input image as a layout structure
Keywords :
character recognition; computerised pattern recognition; computerised picture processing; trees (mathematics); bottom-up strategy; computerised pattern recognition; computerised picture processing; document image analysis; layout structure; projection profiles; syntactic document tree structure; top down image segmentation; Books; Image analysis; Image converters; Image generation; Image segmentation; Image sequence analysis; Information technology; Master-slave; Text analysis; Tree data structures;
Conference_Titel :
Pattern Recognition, 1988., 9th International Conference on
Conference_Location :
Rome
Print_ISBN :
0-8186-0878-1
DOI :
10.1109/ICPR.1988.28346