Title :
The processing of form documents
Author :
Doermann, David S. ; Rosenfeld, Azriel
Author_Institution :
Center for Autom. Res., Maryland Univ., College Park, MD, USA
Abstract :
An overview of an approach to the generic modeling and processing of known forms is presented. The system provides a methodology by which models are generated from regions in the document based on their usage. Automatic extraction of an optimal set of features to be used for registration is proposed, and it is shown how specialized detectors can be designed for each feature based on their position, orientation and width properties. Registration of the form with the model is accomplished using probing to establish correspondence. Form components which are corrupted by markings are detected and isolated, the intersections are interpreted and the properties of the non-form markings are used to reconstruct the strokes through the intersections. The feasibility of these ideas is demonstrated through an implementation of key components of the system
Keywords :
business forms; document handling; feature extraction; automatic feature extraction; form documents; generic modeling; known forms; model generation; non-form markings; optimal set; specialized detectors; stroke reconstruction; width properties; Context modeling; Data mining; Detectors; Educational institutions; Finance; Graphics; Information analysis; Office automation; Optical character recognition software; Process design;
Conference_Titel :
Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
Conference_Location :
Tsukuba Science City
Print_ISBN :
0-8186-4960-7
DOI :
10.1109/ICDAR.1993.395687