Title :
Crossing the lines: making optimal use of context in line-based Handwritten Text Recognition
Author :
J. Tanha;J.D. Does;K. Depuydt;J.A. Sánchez
Author_Institution :
Institute for Dutch Lexicology (INL), Matthias de Vrieshof 3, 2311 BZ Leiden, The Netherlands
Abstract :
Hand-written text recognition (HTR) is often carried out line-by-line: the decoding of text lines is carried out independently. This approach is known to deteriorate recognition accuracy of words and characters close to the line boundaries. The present study investigates this issue from the point of view of the language modeling component of the HTR system. Obviously, lack of linguistic context may be one of the reasons for loss of accuracy, but it certainly is not the only factor in play. We seek to clarify to which extent the problem can be influenced by the language modeling component of the system. We first discuss how to develop adapted language models which significantly improve HTR performance in general. We then focus on the deployment of methods to improve accuracy at line boundaries. The final result is an efficient approach which significantly improves HTR accuracy without changing the basic HTR system setup.
Keywords :
"Hidden Markov models","Adaptation models","Accuracy","Image recognition","Chlorine"
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
DOI :
10.1109/ICDAR.2015.7333903