Title :
Language independent optical character recognition for hand written text
Author :
Ali, Anjum ; Ahmad, Mahmood ; Rafiq, Nasir ; Akber, Javed ; Ahmad, Usman ; Akmal, Shahwar
Author_Institution :
Dept. of Comput. Sci. & Eng., University of Eng. & Technol., Lahore, Pakistan
Abstract :
This paper describes a novel technique for optical character recognition of handwritten text using the basic geometrical strokes contained in the alphabets of a language. It is observed that all the characters of a language can be represented as a set of connected basic geometrical strokes; thus if we break a ligature, even if the ligature contains more than one character, as in the case of cursive languages, the technique can determine/recognize the characters contained in the ligature. The recognition of characters is font independent: however it is also possible to recognize the typed characters of the standard fonts by employing this technique. Hence font based character recognition is a special case of the proposed technique. The technique was implemented by developing a C#.NET application called LIOCR (language independent optical character reader). The results obtained after applying LIOCR to 25 samples of handwritten text have also been reported.
Keywords :
handwritten character recognition; image segmentation; optical character recognition; basic geometrical strokes; handwritten text; language independent optical character recognition; optical character reader; Character recognition; Data preprocessing; Dictionaries; Feature extraction; Geometrical optics; Graphics; Neural networks; Optical character recognition software; Text recognition; XML;
Conference_Titel :
Multitopic Conference, 2004. Proceedings of INMIC 2004. 8th International
Print_ISBN :
0-7803-8680-9
DOI :
10.1109/INMIC.2004.1492850