• DocumentCode
    1580464
  • Title

    Automated discovery of dependencies between logical components in document image understanding

  • Author

    Malerba, Donato ; Esposito, Floriana ; Lisi, Francesca A. ; Altamura, Oronzo

  • Author_Institution
    Dipt. di Inf., Univ. degli Studi di Bari, Italy
  • fYear
    2001
  • fDate
    6/23/1905 12:00:00 AM
  • Firstpage
    174
  • Lastpage
    178
  • Abstract
    Document image understanding denotes the recognition of semantically relevant components in the layout extracted from a document image. This recognition process is based on some visual models, whose manual specification can be a highly demanding task. In order to automatically acquire these models, we propose the application of machine learning techniques. Problems raised by possible dependencies between concepts to be learned are illustrated and solved with a computational strategy based on the separate-and-parallel-conquer search. The approach is tested on a set of real multi-page documents processed by the system WISDOM++. New results confirm the validity of the proposed strategy and show some limits of the learning system used in this work
  • Keywords
    divide and conquer methods; document image processing; learning (artificial intelligence); optical character recognition; search problems; OCR; WISDOM system; computational strategy; document image recognition; document image understanding; logical component dependence discovery; machine learning; multi-page documents; separate-and-parallel-conquer search; visual models; Digital images; Image analysis; Image databases; Image recognition; Optical character recognition software; Optical devices; Publishing; System testing; Text analysis; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
  • Conference_Location
    Seattle, WA
  • Print_ISBN
    0-7695-1263-1
  • Type

    conf

  • DOI
    10.1109/ICDAR.2001.953778
  • Filename
    953778