• DocumentCode
    2439211
  • Title

    A model-based form processing sub-system

  • Author

    Mao, Jianchang ; Abayan, Marlon ; Mohiuddin, K.

  • Author_Institution
    IBM Almaden Res. Center, San Jose, CA, USA
  • Volume
    3
  • fYear
    1996
  • fDate
    25-29 Aug 1996
  • Firstpage
    691
  • Abstract
    This paper presents a model-based form processing sub-system, which consists of a form model database and five modules: (i) form modeling, (ii) form recognition, (iii) form dropout, (iv) form definition tool, and (v) form reconstruction. The form modeling module builds explicit representations of scanned form templates to facilitate form recognition and dropout. It can also assist a user to define various fields on a form. The automatic form recognition eliminates the need for manually sorting input forms. The form dropout module effectively removes pre-printed form content to achieve a high data compression rate and to provide clean data for OCR. Our model-driven form dropout scheme has two major advantages over image-based subtraction methods in both dropout efficiency and quality preservation of filled-in data
  • Keywords
    data compression; document image processing; image recognition; image reconstruction; data compression rate; form definition tool; form dropout; form model database; form recognition; form reconstruction; image-based subtraction methods; model-based form processing sub-system; model-driven form dropout scheme; pre-printed form content; Data compression; Government; Image databases; Image recognition; Image reconstruction; Image retrieval; Image storage; Information retrieval; Optical character recognition software; Sorting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 1996., Proceedings of the 13th International Conference on
  • Conference_Location
    Vienna
  • ISSN
    1051-4651
  • Print_ISBN
    0-8186-7282-X
  • Type

    conf

  • DOI
    10.1109/ICPR.1996.547034
  • Filename
    547034