Title :
Comparing natural and synthetic training data for off-line cursive handwriting recognition
Author :
Varga, Tamas ; Bunke, Horst
Author_Institution :
Institut fur Informatik und angewandte Mathematik, Bern Univ., Switzerland
Abstract :
In this paper, a perturbation model for the generation of synthetic textlines from existing cursively handwritten lines of text, produced by human writers, is presented. The goal of synthetic textline generation is to improve the performance of an off-line cursive handwriting recognition system by providing it with additional, synthetic training data. In earlier papers, it has been shown that it is possible to improve the recognition performance by using such synthetically expanded training sets. In this paper, we investigate the suitability of synthetically generated handwriting when enlarging the training set of a handwriting recognition system in a more rigorous way. In particular, the improvements achieved with synthetic training data are compared to those achieved by expanding the training set using natural, i.e. human written, textlines.
Keywords :
handwritten character recognition; hidden Markov models; learning (artificial intelligence); perturbation techniques; hidden Markov model; natural training data; offline cursive handwriting recognition; perturbation model; synthetic textlines generation; synthetic training data; text handwritten lines; Character recognition; Handwriting recognition; Hidden Markov models; Humans; Interpolation; Nonlinear distortion; Solid modeling; Text recognition; Thumb; Training data; hidden Markov model (HMM).; off-line cursive handwriting recognition; perturbation model; synthetic training data; training set expansion;
Conference_Titel :
Frontiers in Handwriting Recognition, 2004. IWFHR-9 2004. Ninth International Workshop on
Print_ISBN :
0-7695-2187-8
DOI :
10.1109/IWFHR.2004.29