DocumentCode :
1580194
Title :
Why table ground-truthing is hard
Author :
Hu, Jianying ; Kashi, Ramanujan ; Lopresti, Daniel ; Nagy, George ; Wilfong, Gordon
Author_Institution :
Avaya Labs, Avaya Inc., Murray Hill, NJ, USA
fYear :
2001
fDate :
6/23/1905 12:00:00 AM
Firstpage :
129
Lastpage :
133
Abstract :
The principle that for every document analysis task there exists a mechanism for creating well-defined ground-truth is a widely held tenet. Past experience with standard datasets providing ground-truth for character recognition and page segmentation tasks supports this belief. In the process of attempting to evaluate several table recognition algorithms we have been developing, however, we have uncovered a number of serious hurdles connected with the ground-truthing of tables. This problem may, in fact, be much more difficult than it appears. We present a detailed analysis of why table ground-truthing is so hard, including the notions that there may exist more than one acceptable "truth" and/or incomplete or partial "truths"
Keywords :
document image processing; image recognition; image segmentation; character recognition; document analysis task; page segmentation; standard datasets; table ground-truthing; table recognition algorithms; CD-ROMs; Character recognition; Data structures; Ground support; Humans; Robustness; Standards development; Systems engineering and theory; Testing; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7695-1263-1
Type :
conf
DOI :
10.1109/ICDAR.2001.953768
Filename :
953768
Link To Document :
بازگشت