Title :
An experimental HMM-based postal OCR system
Author_Institution :
IBM Almaden Res. Center, San Jose, CA, USA
Abstract :
It is almost universally accepted in speech recognition that phone- or word-level segmentation prior to recognition is neither feasible nor desirable, and in the dynamic (pen-based) handwriting recognition domain the success of segmentation-free techniques points to the same conclusion. But in image-based handwriting recognition, this conclusion is far from being firmly established, and the results presented in this paper show that systems employing character-level presegmentation can be more effective, even within the same HMM paradigm, than systems relying on sliding window feature extraction. We describe two variants of a hidden Markov system recognizing handwritten addresses on US mail, one with presegmentation and one without, and report results on the CEDAR data set
Keywords :
feature extraction; hidden Markov models; image segmentation; optical character recognition; postal services; CEDAR data set; US mail; character-level presegmentation; experimental HMM-based postal OCR system; handwritten addresses; hidden Markov model; image-based handwriting recognition; segmentation-free techniques; sliding window feature extraction; Character recognition; Data mining; Delay effects; Feature extraction; Handwriting recognition; Hidden Markov models; Image segmentation; Optical character recognition software; Postal services; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.595467