DocumentCode :
2835824
Title :
Segmentation-free word recognition with application to Arabic
Author :
Al-Badr, Badr ; Haralick, Robert M.
Author_Institution :
Intelligent Syst. Lab., Washington Univ., Seattle, WA, USA
Volume :
1
fYear :
1995
fDate :
14-16 Aug 1995
Firstpage :
355
Abstract :
This paper describes the design and implementation of a system that recognizes machine-printed Arabic words without prior segmentation. The technique is based on describing symbols in terms of shape primitives. At recognition time, the primitives are detected on a word image using mathematical morphology operations. The system then matches the detected primitives with symbol models. This leads to a spatial arrangement of matched symbol models. The system conducts a search in the space of spatial arrangements of models and outputs the arrangement with the highest posterior probability as the recognition of the word. The advantage of using this whole word approach versus a segmentation approach is that the result of recognition is optimized with regard to the whole word. Results of preliminary experiments using a lexicon of 42,000 words show a recognition rate of 99.4% for noise-free text and 73% for scanned text
Keywords :
character recognition; image recognition; mathematical morphology; design; implementation; lexicon; machine-printed Arabic words; matched symbol models; mathematical morphology; posterior probability; segmentation-free word recognition; shape primitives; spatial arrangement; spatial arrangements; symbols; word image; Character recognition; Image recognition; Image segmentation; Intelligent systems; Laboratories; Morphology; Noise shaping; Optical character recognition software; Shape; Writing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
Conference_Location :
Montreal, Que.
Print_ISBN :
0-8186-7128-9
Type :
conf
DOI :
10.1109/ICDAR.1995.599012
Filename :
599012
Link To Document :
بازگشت