Title :
Design and comparison of segmentation driven and recognition driven Devanagari OCR
Author :
Kompalli, Suryaprakash ; Setlur, Srirangaraj ; Govindaraju, Venu
Author_Institution :
CEDAR, Buffalo Univ., Amherst, NY
Abstract :
We outline two different techniques for OCR of machine printed, multi-font Devanagari text. In the first design, words are segmented along linear boundaries. Subsequently, classification is performed with the assumption of accurate segmentation. The second approach uses classifiers to obtain preliminary hypothesis for each segment of the word. These results are used to guide further segmentation of certain pieces. While the former technique is segmentation driven, the latter method follows the paradigm of recognition driven segmentation. The two approaches are compared by using a standard data set
Keywords :
optical character recognition; text analysis; word processing; machine printed Devanagari text; multifont Devanagari text; recognition driven Devanagari optical character recognition; segmentation driven Devanagari optical character recognition; word segmentation; Capacitive sensors; Character generation; Decision trees; Frequency; Image segmentation; Optical character recognition software; Speech recognition; Testing; Text analysis; Venus;
Conference_Titel :
Document Image Analysis for Libraries, 2006. DIAL '06. Second International Conference on
Conference_Location :
Lyon
Print_ISBN :
0-7695-2531-8
DOI :
10.1109/DIAL.2006.12