Title :
Methodology for flexible and efficient analysis of the performance of page segmentation algorithms
Author :
Antonacopoulos, A. ; Brough, A.
Author_Institution :
Dept. of Comput. Sci., Liverpool Univ., UK
Abstract :
The paper presents part of a new DIA performance analysis framework aimed at layout analysis algorithm developers. A new region representation scheme (an interval based description of isothetic polygons) and a corresponding comparison approach are introduced. These enable fast and accurate geometric comparison of ground truth with results of page segmentation, improving on current evaluation methods. Complex layouts are accurately described and layout analysis methods that handle them can be effectively evaluated. A further benefit of the new approach is that it measures the accuracy of the description of regions, an issue which is important for complex layouts involving non text regions
Keywords :
computational geometry; document image processing; image segmentation; optical character recognition; DIA performance analysis framework; accurate geometric comparison; comparison approach; complex layouts; document image analysis; ground truth; interval based description; isothetic polygons; layout analysis algorithm developers; layout analysis methods; non text regions; page segmentation; page segmentation algorithm performance analysis; region representation scheme; Algorithm design and analysis; Computer science; Error analysis; Image analysis; Image segmentation; Large-scale systems; Optical character recognition software; Performance analysis; Spatial databases; Text analysis;
Conference_Titel :
Document Analysis and Recognition, 1999. ICDAR '99. Proceedings of the Fifth International Conference on
Conference_Location :
Bangalore
Print_ISBN :
0-7695-0318-7
DOI :
10.1109/ICDAR.1999.791822