DocumentCode
1633006
Title
Improvements in BBN´s HMM-Based Offline Arabic Handwriting Recognition System
Author
Saleem, Shirin ; Cao, Huaigu ; Subramanian, Krishna ; Kamali, Matin ; Prasad, Rohit ; Natarajan, Prem
Author_Institution
BBN Technol., Cambridge, MA, USA
fYear
2009
Firstpage
773
Lastpage
777
Abstract
Offline handwriting recognition of free-flowing Arabic text is a challenging task due to the plethora of factors that contribute to the variability in the data. In this paper, we address some of these sources of variability, and present experimental results on a large corpus of handwritten documents. Specific techniques such as the application of context-dependent Hidden Markov Models (HMMs) for the cursive Arabic script, unsupervised adaptation to account for the stylistic variations across scribes, and image pre-processing to remove ruled-lines are explored. In particular, we proposed a novel integration of structural features in the HMM framework which exclusively results in a 9% relative improvement in performance. Overall, we demonstrate a relative reduction of 17% in word error rate over our baseline Arabic handwriting recognition system.
Keywords
data analysis; handwriting recognition; hidden Markov models; image recognition; text analysis; free-flowing Arabic text; handwritten document; hidden Markov model; image pre-processing; offline Arabic handwriting recognition system; Handwriting recognition; Hidden Markov models; Image segmentation; Optical character recognition software; Shape; Testing; Text analysis; Text recognition; Vocabulary; Writing; Arabic; Hidden Markov Models; Offline Handwriting Recognition; Structural Features; Writer Adaptation;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
Conference_Location
Barcelona
ISSN
1520-5363
Print_ISBN
978-1-4244-4500-4
Electronic_ISBN
1520-5363
Type
conf
DOI
10.1109/ICDAR.2009.282
Filename
5277506
Link To Document