• DocumentCode
    2832203
  • Title

    Automatic segmentation for Arabic characters in handwriting documents

  • Author

    Lawgali, A. ; Bouridane, A. ; Angelova, M. ; Ghassemlooy, Z.

  • Author_Institution
    Sch. of Comput., Eng. & Inf. Sci., Northumbria Univ., Newcastle upon Tyne, UK
  • fYear
    2011
  • fDate
    11-14 Sept. 2011
  • Firstpage
    3529
  • Lastpage
    3532
  • Abstract
    The cursive and ligature nature of the Arabic script make the segmentation of words into individual characters a difficult task. Despite attempts to apply methods for cursive Latin and other scripts to Arabic script, it is generally insufficient to segment the Arabic text. This paper proposes a new segmentation algorithm for the handwritten Arabic text and the main idea consists of segmenting the word into sub-words and then computing the baseline of each sub-word. Using the descenders of sub-words and the baseline, candidate points are then calculated using a vertical projection. The algorithm has been tested using 800 handwritten Arabic words taken from the IFN/ENIT database and a comparison made against some existing methods and promising results have been obtained.
  • Keywords
    handwritten character recognition; image segmentation; natural language processing; visual databases; word processing; Arabic script; IFN/ENIT database; cursive Latin scripts; handwritten Arabic text; handwritten Arabic words; segmentation algorithm; segmentation mentation; subword segmentation; Conferences; Databases; Handwriting recognition; Image segmentation; Noise; Shape; Skeleton; Arabic character segmentation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing (ICIP), 2011 18th IEEE International Conference on
  • Conference_Location
    Brussels
  • ISSN
    1522-4880
  • Print_ISBN
    978-1-4577-1304-0
  • Electronic_ISBN
    1522-4880
  • Type

    conf

  • DOI
    10.1109/ICIP.2011.6116476
  • Filename
    6116476