DocumentCode :
1440853
Title :
Gradient-based learning applied to document recognition
Author :
LÉcun, Yann ; Bottou, Leon ; Bengio, Yoshua ; Haffner, Patrick
Author_Institution :
Speech & Image Process. Services Lab., AT&T Bell Labs., Red Bank, NJ, USA
Volume :
86
Issue :
11
fYear :
1998
fDate :
11/1/1998 12:00:00 AM
Firstpage :
2278
Lastpage :
2324
Abstract :
Multilayer neural networks trained with the back-propagation algorithm constitute the best example of a successful gradient based learning technique. Given an appropriate network architecture, gradient-based learning algorithms can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters, with minimal preprocessing. This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task. Convolutional neural networks, which are specifically designed to deal with the variability of 2D shapes, are shown to outperform all other techniques. Real-life document recognition systems are composed of multiple modules including field extraction, segmentation recognition, and language modeling. A new learning paradigm, called graph transformer networks (GTN), allows such multimodule systems to be trained globally using gradient-based methods so as to minimize an overall performance measure. Two systems for online handwriting recognition are described. Experiments demonstrate the advantage of global training, and the flexibility of graph transformer networks. A graph transformer network for reading a bank cheque is also described. It uses convolutional neural network character recognizers combined with global training techniques to provide record accuracy on business and personal cheques. It is deployed commercially and reads several million cheques per day
Keywords :
backpropagation; convolution; multilayer perceptrons; optical character recognition; 2D shape variability; GTN; back-propagation; cheque reading; complex decision surface synthesis; convolutional neural network character recognizers; document recognition; document recognition systems; field extraction; gradient based learning technique; gradient-based learning; graph transformer networks; handwritten character recognition; handwritten digit recognition task; high-dimensional patterns; language modeling; multilayer neural networks; multimodule systems; performance measure minimization; segmentation recognition; Character recognition; Feature extraction; Hidden Markov models; Machine learning; Multi-layer neural network; Neural networks; Optical character recognition software; Optical computing; Pattern recognition; Principal component analysis;
fLanguage :
English
Journal_Title :
Proceedings of the IEEE
Publisher :
ieee
ISSN :
0018-9219
Type :
jour
DOI :
10.1109/5.726791
Filename :
726791
Link To Document :
بازگشت