DocumentCode
2832203
Title
Automatic segmentation for Arabic characters in handwriting documents
Author
Lawgali, A. ; Bouridane, A. ; Angelova, M. ; Ghassemlooy, Z.
Author_Institution
Sch. of Comput., Eng. & Inf. Sci., Northumbria Univ., Newcastle upon Tyne, UK
fYear
2011
fDate
11-14 Sept. 2011
Firstpage
3529
Lastpage
3532
Abstract
The cursive and ligature nature of the Arabic script make the segmentation of words into individual characters a difficult task. Despite attempts to apply methods for cursive Latin and other scripts to Arabic script, it is generally insufficient to segment the Arabic text. This paper proposes a new segmentation algorithm for the handwritten Arabic text and the main idea consists of segmenting the word into sub-words and then computing the baseline of each sub-word. Using the descenders of sub-words and the baseline, candidate points are then calculated using a vertical projection. The algorithm has been tested using 800 handwritten Arabic words taken from the IFN/ENIT database and a comparison made against some existing methods and promising results have been obtained.
Keywords
handwritten character recognition; image segmentation; natural language processing; visual databases; word processing; Arabic script; IFN/ENIT database; cursive Latin scripts; handwritten Arabic text; handwritten Arabic words; segmentation algorithm; segmentation mentation; subword segmentation; Conferences; Databases; Handwriting recognition; Image segmentation; Noise; Shape; Skeleton; Arabic character segmentation;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Processing (ICIP), 2011 18th IEEE International Conference on
Conference_Location
Brussels
ISSN
1522-4880
Print_ISBN
978-1-4577-1304-0
Electronic_ISBN
1522-4880
Type
conf
DOI
10.1109/ICIP.2011.6116476
Filename
6116476
Link To Document