Title :
Reading constrained text using hierarchical hidden Markov models
Author :
Lam, Stephen W. ; Hui, Wai-Kwan
Author_Institution :
Buffalo State Univ. of New York, NY, USA
Abstract :
Text on documents can generally be classified into one of two categories, unconstrained and constrained, based on the spatial and contextual knowledge that is needed for comprehension. These constraints are found to be critical in text understanding. A model-based approach to reading constrained text is described. The system consists of two major components: a knowledge acquisition module and an image decoding module. Constraints on an application domain are encoded in a hierarchical stochastic model. Subparts of the hierarchy form hidden Markov models, which are used to determine decoding credibility. Verification of a terminal component of the hierarchy is performed by word recognition. Interpretation of a text block is by finding the highest credibility path in the hierarchy. The decoding process is application independent, as demonstrated in the experiments. The system has been tested on reading postal permits and other domains will be used in future tests
Keywords :
decoding; hidden Markov models; image coding; knowledge acquisition; constrained text; contextual knowledge; decoding credibility; decoding process; hidden Markov models; hierarchical stochastic model; highest credibility path; image decoding module; knowledge acquisition module; model-based approach; postal permits; reading; terminal component; text block; text understanding; word recognition; Decoding; Hidden Markov models; Image coding; Knowledge acquisition; Sociotechnical systems; Stochastic processes; System testing; Text analysis; Text recognition; Training data;
Conference_Titel :
Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
Conference_Location :
Tsukuba Science City
Print_ISBN :
0-8186-4960-7
DOI :
10.1109/ICDAR.1993.395761