Title :
Lexicon-based offline recognition of Amharic words in unconstrained handwritten text
Author :
Assabie, Yaregal ; Bigun, Josef
Author_Institution :
Sch. of Inf. Sci., Halmstad Univ., Halmstad, Sweden
Abstract :
This paper describes an offline handwriting recognition system for Amharic words based on lexicon. The system computes direction fields of scanned handwritten documents, from which pseudo-characters are segmented. The pseudo-characters are organized based on their proximity and direction to form text lines. Words are then segmented by analyzing the relative gap between subsequent pseudo-characters in text lines. For each segmented word image, the structural characteristics of pseudo-characters are syntactically analyzed to predict a set of plausible characters forming the word. The most likelihood word is finally selected among candidates by matching against the lexicon. The system is tested by a database of unconstrained handwritten Amharic documents collected from various sources. The lexicon is prepared from words appearing in the collected database.
Keywords :
document image processing; handwriting recognition; handwritten character recognition; image matching; image segmentation; natural language processing; word processing; Amharic words; lexicon-based offline recognition; offline handwriting recognition system; pseudo-characters segmentation; scanned handwritten Amharic documents; unconstrained handwritten text; word image segmention; Character generation; Character recognition; Handwriting recognition; Hidden Markov models; Image analysis; Image segmentation; Information science; Natural languages; Pixel; Text recognition;
Conference_Titel :
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location :
Tampa, FL
Print_ISBN :
978-1-4244-2174-9
Electronic_ISBN :
1051-4651
DOI :
10.1109/ICPR.2008.4761145