DocumentCode :
2631522
Title :
Reading constrained text using hierarchical hidden Markov models
Author :
Lam, Stephen W. ; Hui, Wai-Kwan
Author_Institution :
Buffalo State Univ. of New York, NY, USA
fYear :
1993
fDate :
20-22 Oct 1993
Firstpage :
151
Lastpage :
154
Abstract :
Text on documents can generally be classified into one of two categories, unconstrained and constrained, based on the spatial and contextual knowledge that is needed for comprehension. These constraints are found to be critical in text understanding. A model-based approach to reading constrained text is described. The system consists of two major components: a knowledge acquisition module and an image decoding module. Constraints on an application domain are encoded in a hierarchical stochastic model. Subparts of the hierarchy form hidden Markov models, which are used to determine decoding credibility. Verification of a terminal component of the hierarchy is performed by word recognition. Interpretation of a text block is by finding the highest credibility path in the hierarchy. The decoding process is application independent, as demonstrated in the experiments. The system has been tested on reading postal permits and other domains will be used in future tests
Keywords :
decoding; hidden Markov models; image coding; knowledge acquisition; constrained text; contextual knowledge; decoding credibility; decoding process; hidden Markov models; hierarchical stochastic model; highest credibility path; image decoding module; knowledge acquisition module; model-based approach; postal permits; reading; terminal component; text block; text understanding; word recognition; Decoding; Hidden Markov models; Image coding; Knowledge acquisition; Sociotechnical systems; Stochastic processes; System testing; Text analysis; Text recognition; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
Conference_Location :
Tsukuba Science City
Print_ISBN :
0-8186-4960-7
Type :
conf
DOI :
10.1109/ICDAR.1993.395761
Filename :
395761
Link To Document :
بازگشت