DocumentCode :
2079452
Title :
Benchmarking page segmentation algorithms
Author :
Randriamasy, S. ; Vincent, L.
Author_Institution :
Robotics Lab., Harvard Univ., Cambridge, MA, USA
fYear :
1994
fDate :
21-23 Jun 1994
Firstpage :
411
Lastpage :
416
Abstract :
A method for automatically evaluating the quality of document page segmentation algorithms is introduced. Many different zoning techniques are now available but there is no robust method available to benchmark and evaluate them reliably. Our proposed strategy is a region-based approach, in which segmentation results are compared with manually generated “ground truth files”, describing all possible correct segmentations. A segmentation ground truthing scheme has been proposed. The evaluation of segmentation quality is achieved by testing the overlap between the two sets of regions. In fact, the regions are defined as the “black” pixels contained in the extracted polygons. An explicit specification of segmentation errors and a numerical evaluation are derived. The algorithm is simple and fast, and provides a multi-level output for each segmentation
Keywords :
image segmentation; optical character recognition; document page segmentation algorithms; extracted polygons; numerical evaluation; page segmentation algorithms; region-based approach; segmentation ground truthing scheme; zoning techniques; Image segmentation; Optical character recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition, 1994. Proceedings CVPR '94., 1994 IEEE Computer Society Conference on
Conference_Location :
Seattle, WA
ISSN :
1063-6919
Print_ISBN :
0-8186-5825-8
Type :
conf
DOI :
10.1109/CVPR.1994.323859
Filename :
323859
Link To Document :
بازگشت