DocumentCode :
76311
Title :
Handwritten Chinese/Japanese Text Recognition Using Semi-Markov Conditional Random Fields
Author :
Xiang-Dong Zhou ; Da-Han Wang ; Feng Tian ; Cheng-Lin Liu ; Nakagawa, Masaki
Author_Institution :
Beijing Key Lab. of Human-Comput. Interaction, Inst. of Software, Beijing, China
Volume :
35
Issue :
10
fYear :
2013
fDate :
Oct. 2013
Firstpage :
2413
Lastpage :
2426
Abstract :
This paper proposes a method for handwritten Chinese/Japanese text (character string) recognition based on semi-Markov conditional random fields (semi-CRFs). The high-order semi-CRF model is defined on a lattice containing all possible segmentation-recognition hypotheses of a string to elegantly fuse the scores of candidate character recognition and the compatibilities of geometric and linguistic contexts by representing them in the feature functions. Based on given models of character recognition and compatibilities, the fusion parameters are optimized by minimizing the negative log-likelihood loss with a margin term on a training string sample set. A forward-backward lattice pruning algorithm is proposed to reduce the computation in training when trigram language models are used, and beam search techniques are investigated to accelerate the decoding speed. We evaluate the performance of the proposed method on unconstrained online handwritten text lines of three databases. On the test sets of databases CASIA-OLHWDB (Chinese) and TUAT Kondate (Japanese), the character level correct rates are 95.20 and 95.44 percent, and the accurate rates are 94.54 and 94.55 percent, respectively. On the test set (online handwritten texts) of ICDAR 2011 Chinese handwriting recognition competition, the proposed method outperforms the best system in competition.
Keywords :
Markov processes; computational geometry; handwritten character recognition; image segmentation; natural language processing; search problems; CASIA-OLHWDB; TUAT Kondate; beam search techniques; character recognition; decoding speed; feature functions; forward-backward lattice pruning algorithm; geometric contexts; handwritten Chinese-Japanese text recognition; high-order semi-CRF model; linguistic contexts; negative log-likelihood loss; segmentation-recognition hypotheses; semiMarkov conditional random fields; training string sample set; trigram language models; Character recognition; Context; Context modeling; Handwriting recognition; Lattices; Text recognition; Training; Character string recognition; beam search; lattice pruning; semi-Markov conditional random field; Algorithms; Artificial Intelligence; Asian Continental Ancestry Group; Computer Simulation; Handwriting; Humans; Image Enhancement; Image Interpretation, Computer-Assisted; Language; Markov Chains; Models, Statistical; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Subtraction Technique;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2013.49
Filename :
6472237
Link To Document :
بازگشت