Title :
Dynamic planar warping for optical character recognition
Author :
Levin, Esther ; Pieraccini, Roberto
Author_Institution :
AT&T Bell Labs., Murray Hill, NJ, USA
Abstract :
The authors extend the dynamic time warping (DTW) algorithm, widely used in automatic speech recognition (ASR), to a dynamic plane warping (DPW) algorithm, for application in the field of optical character recognition (OCR) or similar applications. Although direct application of the optimality principle reduced the computational complexity somewhat, the DPW (or image alignment) problem is exponential in the dimensions of the image. It is shown that by applying constraints to the image alignment problem, e.g., limiting the class of possible distortions, one can reduce the computational complexity dramatically, and find the optimal solution to the constrained problem in linear time. A statistical model, the planar hidden Markov model (PHMM), describing statistical properties of images is proposed. The PHMM approach was evaluated using a set of isolated handwritten digits. An overall digit recognition accuracy of 95% was achieved. It is expected that the advantage of this approach will be even more significant for harder tasks, such cursive-writing recognition and spotting
Keywords :
hidden Markov models; optical character recognition; DTW; OCR; PHMM; automatic speech recognition; computational complexity; constrained problem; digit recognition accuracy; distortions; dynamic plane warping; dynamic time warping; handwritten digits; image alignment; linear time; optical character recognition; optimality principle; planar hidden Markov model; statistical model; statistical properties; Automatic speech recognition; Character recognition; Computational complexity; Dynamic programming; Handwriting recognition; Hidden Markov models; Lattices; Optical character recognition software; Optical distortion; Testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7803-0532-9
DOI :
10.1109/ICASSP.1992.226254