Title :
Handwritten Word Recognition Using Conditional Random Fields
Author :
Shetty, Sachin ; Srinivasan, H. ; Srihari, S.
Author_Institution :
Univ. at Buffalo, Buffalo
Abstract :
The paper describes a lexicon driven approach for word recognition on handwritten documents using conditional random fields (CRFs). CRFs are discriminative models and do not make any assumptions about the underlying data and hence are known to be superior to hidden Markov models (HMMs) for sequence labeling problems. For word recognition, the document is first segmented into word images using an existing neural network based algorithm. Each word image is then over segmented into a number of small segments such that the combination of segments forms character images. Segment(s) is/are labeled as characters with probability evaluated from the CRF model. The total probability of a word image representing an entry from the lexicon is computed using a dynamic programming algorithm which evaluates the optimal combination of segments.
Keywords :
document image processing; dynamic programming; handwritten character recognition; hidden Markov models; image representation; neural nets; character images; conditional random fields; discriminative models; dynamic programming; handwritten documents; handwritten word recognition; hidden Markov models; image segmentation; lexicon driven approach; neural network; sequence labeling problems; word image represention; word images; Character recognition; Computer science; Dynamic programming; Handwriting recognition; Hidden Markov models; Image recognition; Image segmentation; Labeling; Neural networks; Text analysis;
Conference_Titel :
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location :
Parana
Print_ISBN :
978-0-7695-2822-9
DOI :
10.1109/ICDAR.2007.4377085