DocumentCode :
3593741
Title :
How can document analysis help in capturing five million pages?
Author :
Suda, P. ; Maderlechner, G. ; Bock, H. ; Klunder, H.P.
Author_Institution :
Siemens AG, Munich, Germany
Volume :
1
fYear :
1995
Firstpage :
372
Abstract :
This paper describes how document analysis techniques like OCR, layout analysis, model based recognition and interpretation can be fruitfully applied in the field of high-volume, high-accuracy document capturing with very hard time constraints. We describe the way we set up a workflow that enables reliable capturing of real-estate registration documents. Techniques from document analysis are used to speed up the archiving process and to raise its quality. In particular an automatic determination of the positions for input of new text in partially filled text columns is described. This enables to bridge the gap between the non-coded archived documents and the coded information which is used to update the documents later
Keywords :
document handling; document image processing; optical character recognition; real estate data processing; OCR; document analysis; document capturing; layout analysis; model based interpretation; model based recognition; real-estate registration documents; Bridges; Hardware; Information management; Insurance; Optical character recognition software; Terminology; Text analysis; Text recognition; Time factors; Turning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
Print_ISBN :
0-8186-7128-9
Type :
conf
DOI :
10.1109/ICDAR.1995.599016
Filename :
599016
Link To Document :
بازگشت