• DocumentCode
    3497697
  • Title

    Exploring the use of conditional random field models and HMMs for historical handwritten document recognition

  • Author

    Feng, Shaolei ; Manmatha, R. ; McCallum, Andrew

  • Author_Institution
    Center for Intelligent Inf. Retrieval, Univ. of Massachusetts, Amherst, MA
  • fYear
    2006
  • fDate
    27-28 April 2006
  • Lastpage
    37
  • Abstract
    In this paper we explore different approaches for improving the performance of dependency models on discrete features for handwriting recognition. Hidden Markov models have often been used for handwriting recognition. Conditional random fields (CRF´s) allow for more general dependencies and we investigate their use. We believe that this is the first attempt at apply CRF´s for handwriting recognition. We show that on the whole word recognition task, the CRF performs better than a HMM on a publicly available standard dataset of 20 pages of George Washington´s manuscripts. The scale space for the whole word recognition task is large - almost 1200 states. To make CRF computation tractable we use beam search to make inference more efficient using three different approaches. Better improvement can be obtained using the HMM by directly smoothing the discrete features using the collection frequencies. This shows the importance of smoothing and also indicates the difficulty of training CRF´s when large state spaces are involved
  • Keywords
    document image processing; handwriting recognition; hidden Markov models; George Washington manuscripts; HMM; conditional random field models; conditional random fields; handwriting recognition; hidden Markov models; historical handwritten document recognition; whole word recognition; Character recognition; Frequency; Handwriting recognition; Hidden Markov models; Information retrieval; Labeling; Libraries; Smoothing methods; State-space methods; Zoology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Image Analysis for Libraries, 2006. DIAL '06. Second International Conference on
  • Conference_Location
    Lyon
  • Print_ISBN
    0-7695-2531-8
  • Type

    conf

  • DOI
    10.1109/DIAL.2006.19
  • Filename
    1612944