DocumentCode
401802
Title
Performance evaluation and benchmarking on document layout analysis algorithms
Author
Wu, Jin ; Pan, Wu-mo ; Jin, Jian-Ming ; Wang, Qing-ren
Author_Institution
Inst. of Machine Intelligence, Nankai Univ., Tianjin, China
Volume
4
fYear
2003
fDate
2-5 Nov. 2003
Firstpage
2246
Abstract
The importance of building an objective and accurate benchmarking platform for document layout analysis (DLA) algorithms has been realized by more and more researchers. In this paper, an experimental benchmarking framework for DLA algorithms is proposed. Firstly, a ground-truth database is built where test images and expected layout analysis results for corresponding image are stored. Secondly, the analysis result of a certain test image given by a DLA algorithm is evaluated based on the expected results in the ground-truth database, and a performance metric is given. It is also important to note that the reasonable layout analysis result for a given image might not be unique. Therefore, we implemented a flexible learning mechanism into our framework to get a more accurate performance metric. In the end, we apply our framework to the benchmarking of a document layout analysis algorithm and the experimental results are given.
Keywords
document image processing; performance evaluation; program testing; benchmarking; document layout analysis algorithm; flexible learning mechanism; ground-truth database; performance evaluation; Algorithm design and analysis; Benchmark testing; Electronics packaging; Image analysis; Image databases; Image segmentation; Learning systems; Measurement; Performance analysis; Text analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2003 International Conference on
Print_ISBN
0-7803-8131-9
Type
conf
DOI
10.1109/ICMLC.2003.1259880
Filename
1259880
Link To Document