• DocumentCode
    3325686
  • Title

    Arabic script based character segmentation: A review

  • Author

    Naz, Sabiha ; Hayat, K. ; Razzak, Muhammad Imran ; Anwar, Muhammad Waqas ; Akbar, Habib

  • Author_Institution
    COMSATS Inst. of Inf. Technol., Abbottabad, Pakistan
  • fYear
    2013
  • fDate
    22-24 June 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Segmentation based Arabic script based languages character recognition has been a popular field of research for many years. The challenging nature of Arabic script recognition has attracted the attention of researchers from both industry and academic circles but these efforts have not achieved good results until now. Segmentation of Urdu script when written in Nasta´liq writing style is very difficult task due to the complexity of writing style as compare to Naskh writing style. Good segmentation is one of reasons for high accuracy. Character segmentation has been a critical phase of the OCR process. The higher recognition rates for isolated characters as compare to results of words or connected character well illustrate the importance of segmentation. Current study investigate the recent work for character segmentation and challenges for segmentation for Arabic script based languages.
  • Keywords
    image segmentation; natural languages; optical character recognition; Arabic script based languages; Arabic script recognition; Naskh writing style; Nasta´liq writing style; OCR process; Urdu script; character segmentation; optical character recognition; recognition rates; Accuracy; Character recognition; Complexity theory; Image segmentation; Optical character recognition software; Shape; Writing; Character Recognition; Nastaliq; Segmentation; Urdu OCR;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Technology (WCCIT), 2013 World Congress on
  • Conference_Location
    Sousse
  • Print_ISBN
    978-1-4799-0460-0
  • Type

    conf

  • DOI
    10.1109/WCCIT.2013.6618741
  • Filename
    6618741