Title :
A generic recognition system for making archives documents accessible to public
Author :
Coüasnon, Bertrand ; Leplumey, Ivan
Author_Institution :
IRISA/INRIA, France
Abstract :
This paper presents annotations needed for handwritten archive document retrieval by content. We propose two complementary ways of producing those annotations: automatically by using optical document recognition and collectively by using the Internet and manual input by users. A platform for managing those annotations is presented as well as examples of automatic annotations on civil status registers, military forms (tested on 60,000 pages) and naturalization decrees, using a generic document recognition method. Examples of collective annotations built on automatic annotations are also given.
Keywords :
content-based retrieval; document image processing; image recognition; image segmentation; optical character recognition; DMOS method; Internet; civil status register; content based retrieval; description and modification of segmentation method; generic recognition system; handwritten archive document retrieval; military forms; optical document recognition; Automatic testing; Content based retrieval; Geometrical optics; Internet; Optical character recognition software; Text analysis; Text recognition;
Conference_Titel :
Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
Print_ISBN :
0-7695-1960-1
DOI :
10.1109/ICDAR.2003.1227664