DocumentCode :
3059174
Title :
Analysis of Error Sources Towards Improved Form Processing
Author :
Bhattacharya, Ujjwal ; Shaw, Bikash ; Parui, Swapan K.
Author_Institution :
Indian Stat. Inst., Kolkata
fYear :
2006
fDate :
18-21 Dec. 2006
Firstpage :
137
Lastpage :
138
Abstract :
Automatic form processing is an important application of document analysis subject. Such a system requires to be trained and tested on a standard database of forms collected from real-life. However, to the best of our knowledge, the only such available databases are NIST Special Databases. These databases consist of images of synthesized form documents. On the other hand, recently we developed a form database, samples of which had been taken from the real-life. ISIFormReader, a form processing system, also developed recently, has been tested using these real-life samples. An intensive study of the processing errors showed that writers´ idiosyncracies are one of the major reasons of such errors as analyzed in U. Bhattacharya, et al., (2006). In the present paper, we investigated various other sources of errors which together cause a major concern. These include sample forms which are low in contrast, noisy, smudgy, skewed, scaled disturbing its aspect ratio and so on. An analysis of errors due to similar such sources is important towards development of an improved form processing system.
Keywords :
document image processing; visual databases; NIST special database; automatic form processing; document analysis; error analysis; image database; Error analysis; Image databases; Ink; Intersymbol interference; NIST; Printers; Printing; Signal to noise ratio; System testing; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology, 2006. ICIT '06. 9th International Conference on
Conference_Location :
Bhubaneswar
Print_ISBN :
0-7695-2635-7
Type :
conf
DOI :
10.1109/ICIT.2006.30
Filename :
4273172
Link To Document :
بازگشت