DocumentCode
3059174
Title
Analysis of Error Sources Towards Improved Form Processing
Author
Bhattacharya, Ujjwal ; Shaw, Bikash ; Parui, Swapan K.
Author_Institution
Indian Stat. Inst., Kolkata
fYear
2006
fDate
18-21 Dec. 2006
Firstpage
137
Lastpage
138
Abstract
Automatic form processing is an important application of document analysis subject. Such a system requires to be trained and tested on a standard database of forms collected from real-life. However, to the best of our knowledge, the only such available databases are NIST Special Databases. These databases consist of images of synthesized form documents. On the other hand, recently we developed a form database, samples of which had been taken from the real-life. ISIFormReader, a form processing system, also developed recently, has been tested using these real-life samples. An intensive study of the processing errors showed that writers´ idiosyncracies are one of the major reasons of such errors as analyzed in U. Bhattacharya, et al., (2006). In the present paper, we investigated various other sources of errors which together cause a major concern. These include sample forms which are low in contrast, noisy, smudgy, skewed, scaled disturbing its aspect ratio and so on. An analysis of errors due to similar such sources is important towards development of an improved form processing system.
Keywords
document image processing; visual databases; NIST special database; automatic form processing; document analysis; error analysis; image database; Error analysis; Image databases; Ink; Intersymbol interference; NIST; Printers; Printing; Signal to noise ratio; System testing; Text analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology, 2006. ICIT '06. 9th International Conference on
Conference_Location
Bhubaneswar
Print_ISBN
0-7695-2635-7
Type
conf
DOI
10.1109/ICIT.2006.30
Filename
4273172
Link To Document