Title :
Indexing of handwritten document images
Author :
Syeda-Mahmood, Tanveer
Author_Institution :
Xerox Webster Res. Center, NY, USA
Abstract :
An important problem in the management of scanned handwritten document image collections, is their indexing or retrieval based on word queries. This paper presents a method for fast localization of query words in handwritten images by an adaptation of the principle of geometric hashing. Specifically, a method of location hashing is presented that uses consecutive features along curves to produce small-sized image hash tables that also enable fast indexing. Handwriting variations are handled by assembling groups of word segments separated by inter-letter spacing, which is automatically estimated from sample pages written by an author. Results are presented that indicate the reduction in search as well as precision and recall possible with location hashing of handwritten words
Keywords :
computational geometry; document image processing; handwriting recognition; indexing; query processing; visual databases; curves; document retrieval; fast localization; geometric hashing; handwriting variations; handwritten document image collection; handwritten document image indexing; location hashing; precision; recall; scanned documents; search; small-sized image hash tables; word queries; word segments; Assembly; Environmental management; Handwriting recognition; Histograms; Image retrieval; Image segmentation; Indexing; Optical character recognition software; Robustness; Software libraries;
Conference_Titel :
Document Image Analysis, 1997. (DIA '97) Proceedings., Workshop on
Conference_Location :
San Juan
Print_ISBN :
0-8186-8055-5
DOI :
10.1109/DIA.1997.627094