• DocumentCode
    1236921
  • Title

    A Novel Connectionist System for Unconstrained Handwriting Recognition

  • Author

    Graves, Alex ; Liwicki, Marcus ; Fernandez, S. ; Bertolami, Roman ; Bunke, Horst ; Schmidhuber, Jürgen

  • Author_Institution
    Inst. fur Inf., Tech. Univ. Munchen, Munich
  • Volume
    31
  • Issue
    5
  • fYear
    2009
  • fDate
    5/1/2009 12:00:00 AM
  • Firstpage
    855
  • Lastpage
    868
  • Abstract
    Recognizing lines of unconstrained handwritten text is a challenging task. The difficulty of segmenting cursive or overlapping characters, combined with the need to exploit surrounding context, has led to low recognition rates for even the best current recognizers. Most recent progress in the field has been made either through improved preprocessing or through advances in language modeling. Relatively little work has been done on the basic recognition algorithms. Indeed, most systems rely on the same hidden Markov models that have been used for decades in speech and handwriting recognition, despite their well-known shortcomings. This paper proposes an alternative approach based on a novel type of recurrent neural network, specifically designed for sequence labeling tasks where the data is hard to segment and contains long-range bidirectional interdependencies. In experiments on two large unconstrained handwriting databases, our approach achieves word recognition accuracies of 79.7 percent on online data and 74.1 percent on offline data, significantly outperforming a state-of-the-art HMM-based system. In addition, we demonstrate the network´s robustness to lexicon size, measure the individual influence of its hidden layers, and analyze its use of context. Last, we provide an in-depth discussion of the differences between the network and HMMs, suggesting reasons for the network´s superior performance.
  • Keywords
    handwriting recognition; handwritten character recognition; hidden Markov models; image segmentation; recurrent neural nets; connectionist system; hidden Markov models; language modeling; overlapping character segmentation; recurrent neural network; unconstrained handwriting databases; unconstrained handwriting text recognition; Connectionist temporal classification; Handwriting recognition; Long Short-Term Memory; Offline handwriting recognition; Online handwriting recognition; Recurrent neural networks; Unconstrained handwriting recognition; bidirectional long short-term memory; connectionist temporal classification; hidden Markov model.; offline handwriting; online handwriting; recurrent neural networks; Algorithms; Automatic Data Processing; Handwriting; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Models, Statistical; Pattern Recognition, Automated; Reading; Reproducibility of Results; Sensitivity and Specificity; Subtraction Technique;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2008.137
  • Filename
    4531750