DocumentCode
2834445
Title
Telugu script recognition-a feature based approach
Author
Rao, P.V.S. ; Ajitha, T.M.
Author_Institution
Comput. Syst. & Commun. Group, Tata Inst. of Fundamental Res., Bombay, India
Volume
1
fYear
1995
fDate
14-16 Aug 1995
Firstpage
323
Abstract
Telugu characters can be visualised as being composed of circular segments of different radii. Recognition consists in segmenting the characters into the component elements and identifying them. We choose a feature set to preserve the canonical shapes while filtering out as noise the shape deviations encountered in real life. Hence, this approach does not require extensive training. Instead, “Feature Vector” parameters for individual “basic” characters are extracted from single specimens written in isolation. These are suitably combined to construct “Feature Vectors” for compound characters for the lexicon. These are compared with similar “Feature Vectors” extracted from the test samples to be recognised. Recognition scores ranged from 78 to 90% across different subjects (when the best match alone is taken) and from 91 to 95% for a single subject
Keywords
character recognition; character sets; feature extraction; image segmentation; Telugu characters; canonical shapes; component elements; compound characters; feature set; feature vector parameters; lexicon; script recognition; segmenting; Books; Character recognition; Feature extraction; Noise shaping; Shape; Speech recognition; Testing; Visualization; Writing;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
Conference_Location
Montreal, Que.
Print_ISBN
0-8186-7128-9
Type
conf
DOI
10.1109/ICDAR.1995.599004
Filename
599004
Link To Document