• DocumentCode
    1633006
  • Title

    Improvements in BBN´s HMM-Based Offline Arabic Handwriting Recognition System

  • Author

    Saleem, Shirin ; Cao, Huaigu ; Subramanian, Krishna ; Kamali, Matin ; Prasad, Rohit ; Natarajan, Prem

  • Author_Institution
    BBN Technol., Cambridge, MA, USA
  • fYear
    2009
  • Firstpage
    773
  • Lastpage
    777
  • Abstract
    Offline handwriting recognition of free-flowing Arabic text is a challenging task due to the plethora of factors that contribute to the variability in the data. In this paper, we address some of these sources of variability, and present experimental results on a large corpus of handwritten documents. Specific techniques such as the application of context-dependent Hidden Markov Models (HMMs) for the cursive Arabic script, unsupervised adaptation to account for the stylistic variations across scribes, and image pre-processing to remove ruled-lines are explored. In particular, we proposed a novel integration of structural features in the HMM framework which exclusively results in a 9% relative improvement in performance. Overall, we demonstrate a relative reduction of 17% in word error rate over our baseline Arabic handwriting recognition system.
  • Keywords
    data analysis; handwriting recognition; hidden Markov models; image recognition; text analysis; free-flowing Arabic text; handwritten document; hidden Markov model; image pre-processing; offline Arabic handwriting recognition system; Handwriting recognition; Hidden Markov models; Image segmentation; Optical character recognition software; Shape; Testing; Text analysis; Text recognition; Vocabulary; Writing; Arabic; Hidden Markov Models; Offline Handwriting Recognition; Structural Features; Writer Adaptation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4244-4500-4
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2009.282
  • Filename
    5277506