DocumentCode :
3489800
Title :
Lexicon Reduction Using Segment Descriptors for Arabic Handwriting Recognition
Author :
Parvez, Mohammad Tanvir ; Mahmoud, Sabri A.
Author_Institution :
Comput. Eng. Dept., Qassim Univ., Qassim, Saudi Arabia
fYear :
2013
fDate :
25-28 Aug. 2013
Firstpage :
1265
Lastpage :
1269
Abstract :
This paper presents a robust lexicon reduction technique using segment descriptors for Arabic handwritten text. The method segments an Arabic word into graphemes and adaptively generates a descriptor of the presence/absence of dots in those segments. The segmentation algorithm is based on the characteristic of Arabic script, which indicates predictable segmentations of Arabic characters. This in turn results in novel canonical segment descriptors for the lexicon entries. These descriptors are then used for lexicon reduction using a matching algorithm adapted for Arabic handwriting. Unlike other methods, features based on segment descriptors are computable for both word images and lexicon entries. Experimental results are reported on IfN/ENIT database which compare favorably with other approaches for lexicon reduction.
Keywords :
handwriting recognition; image segmentation; natural language processing; text detection; Arabic characters; Arabic handwriting recognition; Arabic script; lexicon entries; matching algorithm; robust lexicon reduction technique; segment descriptors; word images; Accuracy; Databases; Educational institutions; Feature extraction; Handwriting recognition; Image segmentation; canonical descriptor; dot assignment; lexicon reduction; segment descriptor;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
ISSN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2013.256
Filename :
6628817
Link To Document :
بازگشت