• DocumentCode
    3487730
  • Title

    High-Performance OCR for Printed English and Fraktur Using LSTM Networks

  • Author

    Breuel, Thomas M. ; Ul-Hasan, Adnan ; Al-Azawi, Mayce Ali ; Shafait, Faisal

  • Author_Institution
    Tech. Univ. of Kaiserslautern, Kaiserslautern, Germany
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    683
  • Lastpage
    687
  • Abstract
    Long Short-Term Memory (LSTM) networks have yielded excellent results on handwriting recognition. This paper describes an application of bidirectional LSTM networks to the problem of machine-printed Latin and Fraktur recognition. Latin and Fraktur recognition differs significantly from handwriting recognition in both the statistical properties of the data, as well as in the required, much higher levels of accuracy. Applications of LSTM networks to handwriting recognition use two-dimensional recurrent networks, since the exact position and baseline of handwritten characters is variable. In contrast, for printed OCR, we used a one-dimensional recurrent network combined with a novel algorithm for baseline and x-height normalization. A number of databases were used for training and testing, including the UW3 database, artificially generated and degraded Fraktur text and scanned pages from a book digitization project. The LSTM architecture achieved 0.6% character-level test-set error on English text. When the artificially degraded Fraktur data set is divided into training and test sets, the system achieves an error rate of 1.64%. On specific books printed in Fraktur (not part of the training set), the system achieves error rates of 0.15% (Fontane) and 1.47% (Ersch-Gruber). These recognition accuracies were found without using any language modelling or any other post-processing techniques.
  • Keywords
    handwriting recognition; natural language processing; optical character recognition; statistical analysis; text analysis; English text; Fraktur; Fraktur text; LSTM networks; UW3 database; book digitization project; handwriting recognition; handwritten characters; high-performance OCR; long short term memory networks; machine printed Fraktur recognition; machine printed Latin recognition; printed English; printed OCR; recurrent networks; scanned pages; statistical properties; Error analysis; Handwriting recognition; Hidden Markov models; Optical character recognition software; Recurrent neural networks; Training; LSTM Networks; OCR; RNN;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.140
  • Filename
    6628705