Title :
Anatomy of a hand-filled form reader
Author :
Chhabra, Atul K.
Author_Institution :
NYNEX Sci. & Technol. Inc., White Plains, NY, USA
Abstract :
We describe a prototype generic form reader (GFR) system for reading hand-filled forms. The system can read run-on or touching handprinted characters. A one-time form specification is required for each type of form that the system is expected to read. The form specification includes geometric location of registration marks and fields of interest, field grammars, and system parameters. The GFR begins by detecting registration marks, computing image skew, extracting deskewed fields, and computing connected components in the field images. Next, the connected components are split into segments using heuristics about good splitting points. The system is liberal in splitting, i.e., a split segment could be a part of a character or a complete character, and hopefully no more than a character. Next, the segments are adaptively regrouped into `seg-groups´ with the aid of a dynamic programming algorithm that matches the character answers for the seg-groups with the field grammar specification. The single character recognizer (SCR) uses high order combinations of raw geometric features derived from segments and seg-groups. The high order combining rules are derived by statistical discriminant analysis of raw features. The GFR system provides some generic tools that can be applied to other document image analysis problems besides forms reading
Keywords :
character recognition; dynamic programming; grammars; deskewed fields; document image analysis problems; dynamic programming algorithm; field grammars; form specification; generic tools; hand-filled form reader; handprinted characters; image skew; prototype generic form reader system; registration marks; single character recognizer; statistical discriminant analysis; system parameters; Anatomy; Character recognition; Dynamic programming; Heuristic algorithms; Image analysis; Image converters; Image segmentation; Prototypes; Text analysis; Thyristors;
Conference_Titel :
Applications of Computer Vision, 1994., Proceedings of the Second IEEE Workshop on
Conference_Location :
Sarasota, FL
Print_ISBN :
0-8186-6410-X
DOI :
10.1109/ACV.1994.341309