Title :
A solution to the problem of touching and broken characters
Author :
Rocha, Jairo ; Pavlidis, Theo
Author_Institution :
Dept. of Comput. Sci., State Univ. of New York, Stony Brook, NY, USA
Abstract :
A segmentation-free approach to OCR is presented as part of a knowledge based word interpretation model. This method is based on the recognition of subgraphs homeomorphic to previously defined prototypes of characters. Gaps are identified as potential part of characters by implementing a variant of the notion of relative neighborhood used in computational perception. In the system, each subgraph of features that matches a previously defined character prototype is recognized anywhere in the word even if it corresponds to a broken character or to a character touching another one. Each subgraph that is recognized is introduced as a node in a direct net that compiles different alternatives of interpretation of the features in the feature graph. A final search for the optimal path under certain criteria gives the best interpretation of the word features
Keywords :
document image processing; expert systems; feature extraction; optical character recognition; OCR; broken characters; computational perception; feature extraction; feature graph; knowledge based word interpretation model; optimal path; relative neighborhood; segmentation-free approach; subgraph recognition; touching characters; Character recognition; Computer science; Feature extraction; Handwriting recognition; Image segmentation; Optical character recognition software; Pattern recognition; Prototypes; Shape; Usability;
Conference_Titel :
Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
Conference_Location :
Tsukuba Science City
Print_ISBN :
0-8186-4960-7
DOI :
10.1109/ICDAR.1993.395663