Title :
New Metrics for Evaluating Performance in Document Analysis Tasks_Application to the Table Case
Author :
Silva, Ana Costa E
Author_Institution :
Univ. of Edinburgh, Edinburgh
Abstract :
Is an algorithm capable of high precision and recall at classifying lines as part of table really good at locating tables? Several document analysis tasks require gluing or cutting certain document elements to form others. The suitability of the commonly used precision and recall for such division/aggregation tasks is arguable, since their underlying assumption is that the granularity of the items at input is the same as at output. We propose new evaluation metrics especially suited for this type of tasks, and show their application in several table tasks. In the process we present robust table location and cell segmentation algorithms.
Keywords :
document handling; pattern classification; aggregation tasks; cell segmentation algorithms; division tasks; document analysis; evaluation metrics; robust table location; Cost function; Databases; Error correction; Functional analysis; Image analysis; Information retrieval; Measurement units; Performance analysis; Robustness; Text analysis;
Conference_Titel :
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location :
Parana
Print_ISBN :
978-0-7695-2822-9
DOI :
10.1109/ICDAR.2007.4378756