DocumentCode
2144743
Title
Iterative Analysis of Pages in Document Collections for Efficient User Interaction
Author
Chazalon, Joseph ; Coüasnon, Bertrand ; Lemaitre, Aurélie
Author_Institution
INSA Rennes, UEB, Rennes, France
fYear
2011
fDate
18-21 Sept. 2011
Firstpage
503
Lastpage
507
Abstract
The analysis of sets of degraded documents, like historical ones, is error-prone and requires human help to achieve acceptable quality levels. However, human interaction raises 3 main issues when processing important amounts of pages: none of the user or the system should wait for work, information provided by a human operator should not be restricted to local isolated corrections, but rather produce durable changes in the system, the ability to interact with a human operator should not increase the complexity of document models nor duplicate them between analysis and human interaction processes. To solve those issues, we propose an iterative approach, based on a special mechanism called visual memory, to reintegrate external information during page analysis. So as to demonstrate the interest for existing systems, we explain how we adapted a (rule-based) page analysis tool to enable, in this iterative approach, a delayed interaction with a human operator based on an adaptation of error recovery principles for compilers and the well-known exception handling mechanism. We validated our iterative approach on sales registers from the 18th century.
Keywords
document handling; iterative methods; degraded documents; document collections; efficient user interaction; error recovery principles; human interaction processes; iterative analysis; page analysis; visual memory; Analytical models; Humans; Image recognition; Iterative methods; Semantics; Text analysis; Visualization; degraded documents; document analysis; document sets; iterative analysis; user interaction;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location
Beijing
ISSN
1520-5363
Print_ISBN
978-1-4577-1350-7
Electronic_ISBN
1520-5363
Type
conf
DOI
10.1109/ICDAR.2011.107
Filename
6065362
Link To Document