DocumentCode :
1056035
Title :
Using generative models for handwritten digit recognition
Author :
Revow, Michael ; Williams, Christopher K I ; Hinton, Geoffrey E.
Author_Institution :
Dept. of Comput. Sci., Toronto Univ., Ont., Canada
Volume :
18
Issue :
6
fYear :
1996
fDate :
6/1/1996 12:00:00 AM
Firstpage :
592
Lastpage :
606
Abstract :
We describe a method of recognizing handwritten digits by fitting generative models that are built from deformable B-splines with Gaussian “ink generators” spaced along the length of the spline. The splines are adjusted using a novel elastic matching procedure based on the expectation maximization algorithm that maximizes the likelihood of the model generating the data. This approach has many advantages: 1) the system not only produces a classification of the digit but also a rich description of the instantiation parameters which can yield information such as the writing style; 2) the generative models can perform recognition driven segmentation; 3) the method involves a relatively small number of parameters and hence training is relatively easy and fast; and 4) unlike many other recognition schemes, it does not rely on some form of pre-normalization of input images, but can handle arbitrary scalings, translations and a limited degree of image rotation. We have demonstrated that our method of fitting models to images does not get trapped in poor local minima. The main disadvantage of the method is that it requires much more computation than more standard OCR techniques
Keywords :
computer vision; image matching; image segmentation; learning (artificial intelligence); optical character recognition; optimisation; probability; splines (mathematics); Gaussian ink generators; deformable B-splines; deformable model; elastic matching; elastic net; expectation maximization; generative models; handwritten digit recognition; image rotation; learning; probabilistic model; scalings; segmentation; writing style; Character recognition; Computer vision; Deformable models; Handwriting recognition; Image generation; Image recognition; Image segmentation; Ink; Optical character recognition software; Optical noise;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/34.506410
Filename :
506410
Link To Document :
بازگشت