• DocumentCode
    2144743
  • Title

    Iterative Analysis of Pages in Document Collections for Efficient User Interaction

  • Author

    Chazalon, Joseph ; Coüasnon, Bertrand ; Lemaitre, Aurélie

  • Author_Institution
    INSA Rennes, UEB, Rennes, France
  • fYear
    2011
  • fDate
    18-21 Sept. 2011
  • Firstpage
    503
  • Lastpage
    507
  • Abstract
    The analysis of sets of degraded documents, like historical ones, is error-prone and requires human help to achieve acceptable quality levels. However, human interaction raises 3 main issues when processing important amounts of pages: none of the user or the system should wait for work, information provided by a human operator should not be restricted to local isolated corrections, but rather produce durable changes in the system, the ability to interact with a human operator should not increase the complexity of document models nor duplicate them between analysis and human interaction processes. To solve those issues, we propose an iterative approach, based on a special mechanism called visual memory, to reintegrate external information during page analysis. So as to demonstrate the interest for existing systems, we explain how we adapted a (rule-based) page analysis tool to enable, in this iterative approach, a delayed interaction with a human operator based on an adaptation of error recovery principles for compilers and the well-known exception handling mechanism. We validated our iterative approach on sales registers from the 18th century.
  • Keywords
    document handling; iterative methods; degraded documents; document collections; efficient user interaction; error recovery principles; human interaction processes; iterative analysis; page analysis; visual memory; Analytical models; Humans; Image recognition; Iterative methods; Semantics; Text analysis; Visualization; degraded documents; document analysis; document sets; iterative analysis; user interaction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2011 International Conference on
  • Conference_Location
    Beijing
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4577-1350-7
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2011.107
  • Filename
    6065362