Title :
Fast and Robust Training of Recurrent Neural Networks for Offline Handwriting Recognition
Author :
Doetsch, Patrick ; Kozielski, Michal ; Ney, Hermann
Author_Institution :
Comput. Sci. Dept., RWTH Aachen Univ., Aachen, Germany
Abstract :
In this paper we demonstrate a modified topology for long short-term memory recurrent neural networks that controls the shape of the squashing functions in gating units. We further propose an efficient training framework based on a mini-batch training on sequence level combined with a sequence chunking approach. The framework is evaluated on publicly available data sets containing English and French handwriting by utilizing a GPU based implementation. Speedups of more than 3x are achieved in training recurrent neural network models which outperform state of the art recognition results.
Keywords :
handwriting recognition; learning (artificial intelligence); recurrent neural nets; English handwriting; French handwriting; GPU based implementation; gating units; long short-term memory recurrent neural networks; mini-batch training; modified topology; offline handwriting recognition; recurrent neural networks training; sequence chunking approach; sequence level; squashing functions; Databases; Graphics processing units; Handwriting recognition; Hidden Markov models; Logic gates; Recurrent neural networks; Training; GPU; batch-training; handwriting recognition; recurrent neural networks;
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2014 14th International Conference on
Conference_Location :
Heraklion
Print_ISBN :
978-1-4799-4335-7
DOI :
10.1109/ICFHR.2014.54