• DocumentCode
    3059174
  • Title

    Analysis of Error Sources Towards Improved Form Processing

  • Author

    Bhattacharya, Ujjwal ; Shaw, Bikash ; Parui, Swapan K.

  • Author_Institution
    Indian Stat. Inst., Kolkata
  • fYear
    2006
  • fDate
    18-21 Dec. 2006
  • Firstpage
    137
  • Lastpage
    138
  • Abstract
    Automatic form processing is an important application of document analysis subject. Such a system requires to be trained and tested on a standard database of forms collected from real-life. However, to the best of our knowledge, the only such available databases are NIST Special Databases. These databases consist of images of synthesized form documents. On the other hand, recently we developed a form database, samples of which had been taken from the real-life. ISIFormReader, a form processing system, also developed recently, has been tested using these real-life samples. An intensive study of the processing errors showed that writers´ idiosyncracies are one of the major reasons of such errors as analyzed in U. Bhattacharya, et al., (2006). In the present paper, we investigated various other sources of errors which together cause a major concern. These include sample forms which are low in contrast, noisy, smudgy, skewed, scaled disturbing its aspect ratio and so on. An analysis of errors due to similar such sources is important towards development of an improved form processing system.
  • Keywords
    document image processing; visual databases; NIST special database; automatic form processing; document analysis; error analysis; image database; Error analysis; Image databases; Ink; Intersymbol interference; NIST; Printers; Printing; Signal to noise ratio; System testing; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology, 2006. ICIT '06. 9th International Conference on
  • Conference_Location
    Bhubaneswar
  • Print_ISBN
    0-7695-2635-7
  • Type

    conf

  • DOI
    10.1109/ICIT.2006.30
  • Filename
    4273172