DocumentCode :
1461717
Title :
Supervised template estimation for document image decoding
Author :
Kopec, Gary E. ; Lomelin, Mauricio
Author_Institution :
Xerox Palo Alto Res. Center, CA, USA
Volume :
19
Issue :
12
fYear :
1997
fDate :
12/1/1997 12:00:00 AM
Firstpage :
1313
Lastpage :
1324
Abstract :
An approach to supervised training of character templates from page images and unaligned transcriptions is proposed. The template training problem is formulated as one of constrained maximum likelihood parameter estimation within the document image decoding framework. This leads to a three-phase iterative training algorithm consisting of transcription alignment, aligned template estimation (ATE), and channel estimation steps. The maximum likelihood ATE problem is shown to be NP-complete and, thus, an approximate solution approach is developed. An evaluation of the training procedure in a document-specific decoding task, using the University of Washington UW-II database of scanned technical journal articles, is described
Keywords :
Markov processes; character recognition; computational complexity; document image processing; image colour analysis; iterative methods; maximum likelihood decoding; maximum likelihood estimation; visual databases; NP-complete problem; UW-II database; University of Washington; aligned template estimation; channel estimation; character templates; constrained maximum likelihood parameter estimation; document image decoding; supervised template estimation; supervised training; technical journal articles; template training problem; three-phase iterative training algorithm; transcription alignment; unaligned transcriptions; Character recognition; Hidden Markov models; Image recognition; Image segmentation; Iterative algorithms; Iterative decoding; Maximum likelihood decoding; Maximum likelihood estimation; Parameter estimation; Shape;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/34.643891
Filename :
643891
Link To Document :
بازگشت