DocumentCode :
2220763
Title :
Transcript mapping for historic handwritten document images
Author :
Tomai, Catalin I. ; Zhang, Bin ; Govindaraju, Venu
Author_Institution :
UB Commons, CEDAR, Amherst, NY, USA
fYear :
2002
fDate :
2002
Firstpage :
413
Lastpage :
418
Abstract :
There is a large number of scanned historical documents that need to be indexed for archival and retrieval purposes. A visual word spotting scheme that would serve these purposes is a challenging task even when the transcription of the document image is available. We propose a framework for mapping each word in the transcript to the associated word image in the document. Coarse word mapping based on document constraints is used for lexicon reduction. Then, word mappings are refined using word recognition results by a dynamic programming algorithm that finds the best match while satisfying the constraints.
Keywords :
document image processing; handwritten character recognition; humanities; image segmentation; indexing; dynamic programming; historic handwritten document images; indexing; lexicon reduction; visual word spotting; word recognition mapping; word segmentation; Availability; Dynamic programming; Heuristic algorithms; Image retrieval; Image segmentation; Indexing; Optical character recognition software; Prototypes; Smoothing methods; Venus;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Frontiers in Handwriting Recognition, 2002. Proceedings. Eighth International Workshop on
Print_ISBN :
0-7695-1692-0
Type :
conf
DOI :
10.1109/IWFHR.2002.1030945
Filename :
1030945
Link To Document :
بازگشت