• DocumentCode
    2629825
  • Title

    A solution to the problem of touching and broken characters

  • Author

    Rocha, Jairo ; Pavlidis, Theo

  • Author_Institution
    Dept. of Comput. Sci., State Univ. of New York, Stony Brook, NY, USA
  • fYear
    1993
  • fDate
    20-22 Oct 1993
  • Firstpage
    602
  • Lastpage
    605
  • Abstract
    A segmentation-free approach to OCR is presented as part of a knowledge based word interpretation model. This method is based on the recognition of subgraphs homeomorphic to previously defined prototypes of characters. Gaps are identified as potential part of characters by implementing a variant of the notion of relative neighborhood used in computational perception. In the system, each subgraph of features that matches a previously defined character prototype is recognized anywhere in the word even if it corresponds to a broken character or to a character touching another one. Each subgraph that is recognized is introduced as a node in a direct net that compiles different alternatives of interpretation of the features in the feature graph. A final search for the optimal path under certain criteria gives the best interpretation of the word features
  • Keywords
    document image processing; expert systems; feature extraction; optical character recognition; OCR; broken characters; computational perception; feature extraction; feature graph; knowledge based word interpretation model; optimal path; relative neighborhood; segmentation-free approach; subgraph recognition; touching characters; Character recognition; Computer science; Feature extraction; Handwriting recognition; Image segmentation; Optical character recognition software; Pattern recognition; Prototypes; Shape; Usability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
  • Conference_Location
    Tsukuba Science City
  • Print_ISBN
    0-8186-4960-7
  • Type

    conf

  • DOI
    10.1109/ICDAR.1993.395663
  • Filename
    395663