Title :
Keyword spotting for cursive document retrieval
Author :
Keaton, Patricia ; Greenspan, Hayit ; Goodman, Rodney
Author_Institution :
Dept. of Electr. Eng., California Inst. of Technol., Pasadena, CA, USA
Abstract :
We present one of the first attempts towards automatic retrieval of documents, in the noisy environment of unconstrained, multiple author handwritten forms. The documents were written in cursive script for which conventional OCR and text retrieval engines are not adequate. We focus on a visual word spotting indexing scheme for scanned documents housed in the Archives of the Indies in Seville, Spain. The framework presented utilizes pattern recognition, learning and information fusion methods, and is motivated from human word-spotting studies. The proposed system is described and initial results are presented
Keywords :
document image processing; handwriting recognition; indexing; optical character recognition; pattern recognition; query processing; visual databases; Archives of the Indies; OCR; cursive document retrieval; cursive script; human word-spotting studies; information fusion; keyword spotting; learning; multiple author handwritten forms; pattern recognition; scanned documents; text retrieval engines; visual word spotting indexing; Data mining; Databases; Engines; Humans; Image processing; Image segmentation; Indexing; Optical character recognition software; Pattern recognition; Working environment noise;
Conference_Titel :
Document Image Analysis, 1997. (DIA '97) Proceedings., Workshop on
Conference_Location :
San Juan
Print_ISBN :
0-8186-8055-5
DOI :
10.1109/DIA.1997.627095