• DocumentCode
    2243185
  • Title

    Anatomy of a hand-filled form reader

  • Author

    Chhabra, Atul K.

  • Author_Institution
    NYNEX Sci. & Technol. Inc., White Plains, NY, USA
  • fYear
    1994
  • fDate
    5-7 Dec 1994
  • Firstpage
    195
  • Lastpage
    204
  • Abstract
    We describe a prototype generic form reader (GFR) system for reading hand-filled forms. The system can read run-on or touching handprinted characters. A one-time form specification is required for each type of form that the system is expected to read. The form specification includes geometric location of registration marks and fields of interest, field grammars, and system parameters. The GFR begins by detecting registration marks, computing image skew, extracting deskewed fields, and computing connected components in the field images. Next, the connected components are split into segments using heuristics about good splitting points. The system is liberal in splitting, i.e., a split segment could be a part of a character or a complete character, and hopefully no more than a character. Next, the segments are adaptively regrouped into `seg-groups´ with the aid of a dynamic programming algorithm that matches the character answers for the seg-groups with the field grammar specification. The single character recognizer (SCR) uses high order combinations of raw geometric features derived from segments and seg-groups. The high order combining rules are derived by statistical discriminant analysis of raw features. The GFR system provides some generic tools that can be applied to other document image analysis problems besides forms reading
  • Keywords
    character recognition; dynamic programming; grammars; deskewed fields; document image analysis problems; dynamic programming algorithm; field grammars; form specification; generic tools; hand-filled form reader; handprinted characters; image skew; prototype generic form reader system; registration marks; single character recognizer; statistical discriminant analysis; system parameters; Anatomy; Character recognition; Dynamic programming; Heuristic algorithms; Image analysis; Image converters; Image segmentation; Prototypes; Text analysis; Thyristors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications of Computer Vision, 1994., Proceedings of the Second IEEE Workshop on
  • Conference_Location
    Sarasota, FL
  • Print_ISBN
    0-8186-6410-X
  • Type

    conf

  • DOI
    10.1109/ACV.1994.341309
  • Filename
    341309