Telugu script recognition-a feature based approach

Author

Rao, P.V.S. ; Ajitha, T.M.

Author_Institution

Comput. Syst. & Commun. Group, Tata Inst. of Fundamental Res., Bombay, India

Volume

1

fYear

1995

fDate

14-16 Aug 1995

Firstpage

323

Abstract

Telugu characters can be visualised as being composed of circular segments of different radii. Recognition consists in segmenting the characters into the component elements and identifying them. We choose a feature set to preserve the canonical shapes while filtering out as noise the shape deviations encountered in real life. Hence, this approach does not require extensive training. Instead, “Feature Vector” parameters for individual “basic” characters are extracted from single specimens written in isolation. These are suitably combined to construct “Feature Vectors” for compound characters for the lexicon. These are compared with similar “Feature Vectors” extracted from the test samples to be recognised. Recognition scores ranged from 78 to 90% across different subjects (when the best match alone is taken) and from 91 to 95% for a single subject

Keywords

character recognition; character sets; feature extraction; image segmentation; Telugu characters; canonical shapes; component elements; compound characters; feature set; feature vector parameters; lexicon; script recognition; segmenting; Books; Character recognition; Feature extraction; Noise shaping; Shape; Speech recognition; Testing; Visualization; Writing;

fLanguage

English

Publisher

ieee

Conference_Titel

Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on

Conference_Location

Montreal, Que.

Print_ISBN

0-8186-7128-9

Type

conf

DOI

10.1109/ICDAR.1995.599004

Filename

599004