DocumentCode :
2031282
Title :
Connected and degraded text recognition using planar hidden Markov models
Author :
Agazzi, Oscar E. ; Kuo, Shyh-shiaw ; Levin, Esther ; Pieraccini, Roberto
Author_Institution :
AT&T Bell Lab., Murray Hill, NJ, USA
Volume :
5
fYear :
1993
fDate :
27-30 April 1993
Firstpage :
113
Abstract :
An algorithm for connected text recognition using enhanced planar hidden Markov models (PHMMs) is presented. The algorithm automatically segments text into characters (even if they are highly blurred and touching) as an integral part of the recognition process, thus jointly optimizing segmentation and recognition. Performance is enhanced by the use of state length models, transition probabilities among characters (bigrams), and grammars. Experiments are presented using: (1) a simulated database of over 24000 highly degraded images of city names and (2) a database of 6000 images rejected by a high-performance commercial OCR (optical character recognition) machine with 99.5% accuracy. Measured performance on the first database is 99.65% for the most degraded images when a grammar is used, and 98.76% in the second database. Traditional OCR algorithms would fail drastically on these images.<>
Keywords :
grammars; hidden Markov models; image segmentation; optical character recognition; connected text recognition; degraded text recognition; grammars; performance; planar hidden Markov models; segmentation; state length models; transition probabilities;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location :
Minneapolis, MN, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.1993.319760
Filename :
319760
Link To Document :
بازگشت