Title :
Candidate search and elimination approach for Telugu OCR
Author :
Negi, Atul ; Chereddi, Chandra Kanth
Author_Institution :
Dept. of Comput. & Inf. Sci., Hyderabad Univ., India
Abstract :
Telugu is one of the prominent scripts in India and Asia. We propose an OCR system for Telugu based on the candidate search and elimination technique. The initial candidates for recognition are found by applying a zoning method on input glyphs. We propose cavities as a structural approach suited specifically for Telugu script, where cavity vectors are used to prune the candidates found by zoning. A final template matching stage using controlled nonlinear normalization is performed to conclude the search process. The search can be concluded, at any stage, whenever a unique candidate is found. A recognition accuracy of 97-98% was achieved on real images scanned from Telugu literature.
Keywords :
image matching; optical character recognition; search problems; Telugu OCR; candidate elimination; candidate search; cavity vectors; controlled nonlinear normalization; input glyphs; template matching; zoning method; Asia; Data mining; Euclidean distance; Image analysis; Image recognition; Natural languages; Optical character recognition software; Robustness; Speech recognition; Vectors;
Conference_Titel :
TENCON 2003. Conference on Convergent Technologies for the Asia-Pacific Region
Print_ISBN :
0-7803-8162-9
DOI :
10.1109/TENCON.2003.1273278