Title :
Long-short term memory neural networks language modeling for handwriting recognition
Author :
Frinken, Volkmar ; Zamora-Martinez, Francisco ; Espana-Boquera, Salvador ; Castro-Bleda, Maria Jose ; Fischer, Anath ; Bunke, Horst
Author_Institution :
Centre de Visio per Computador, Univ. Autonoma de Bacelona, Bellaterra, Spain
Abstract :
Unconstrained handwritten text recognition systems maximize the combination of two separate probability scores. The first one is the observation probability that indicates how well the returned word sequence matches the input image. The second score is the probability that reflects how likely a word sequence is according to a language model. Current state-of-the-art recognition systems use statistical language models in form of bigram word probabilities. This paper proposes to model the target language by means of a recurrent neural network with long-short term memory cells. Because the network is recurrent, the considered context is not limited to a fixed size especially as the memory cells are designed to deal with long-term dependencies. In a set of experiments conducted on the IAM off-line database we show the superiority of the proposed language model over statistical n-gram models.
Keywords :
content-addressable storage; handwriting recognition; image matching; image sequences; probability; recurrent neural nets; simulation languages; statistical analysis; text analysis; word processing; IAM offline database; bigram word probability; image matching; long-short term memory cells; observation probability; probability score; recurrent neural network; statistical language model; statistical n-gram model; unconstrained handwritten text recognition; word sequence; Artificial neural networks; Context; Handwriting recognition; Probability; Testing; Training;
Conference_Titel :
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location :
Tsukuba
Print_ISBN :
978-1-4673-2216-4