Title :
Document image analysis for World War II personal records
Author :
Antonacopoulos, A. ; Karatzas, D.
Author_Institution :
Dept. of Comput. Sci., Liverpool Univ., UK
Abstract :
Complete collections of invaluable documents of unique historical and political significance are decaying and at the same time they are virtually inaccessible, necessitating the invention of robust and efficient methods for their conversion into a searchable electronic form. We present the issues encountered and problems addressed in the MEMORIAL project, whose goal is the establishment of a digital document workbench enabling the creation of distributed virtual archives based on documents existing in libraries, archives, museums, memorials, and public record offices. Successful approaches are described in the context of the chosen data class: a variety of typewritten documents containing personal information relating to the presence of individuals in World War II Nazi concentration camps.
Keywords :
character recognition; digital libraries; document image processing; history; MEMORIAL project; World War II Nazi concentration camps; World War II personal records; digital document workbench; digital libraries; distributed virtual archives; document image analysis; typewritten documents; Computer science; Europe; Government; History; Image analysis; Ink; Pattern recognition; Robustness; Software libraries; Text analysis;
Conference_Titel :
Document Image Analysis for Libraries, 2004. Proceedings. First International Workshop on
Print_ISBN :
0-7695-2088-X
DOI :
10.1109/DIAL.2004.1263263