Title :
XML Processing in the Cloud: Large-Scale Digital Preservation in Small Institutions
Author :
Wittek, Peter ; Jacquin, Thierry ; Déjean, Hervé ; Chanod, Jean-Pierre ; Daranyi, S.
Author_Institution :
Swedish Sch. of Libr. & Inf. Sci., Univ. of Boras, Boras, Sweden
Abstract :
Digital preservation deals with the problem of retaining the meaning of digital information over time to ensure its accessibility. The process often involves a workflow which transforms the digital objects. The workflow defines document pipelines containing transformations and validation checkpoints, either to facilitate migration for persistent archival or to extract metadata. The transformations, nevertheless, are computationally expensive, and therefore digital preservation can be out of reach for an organization whose core operation is not in data conservation. The operations described the document workflow, however, do not frequently reoccur. This paper combines an implementation-independent workflow designer with cloud computing to support small institution in their ad-hoc peak computing needs that stem from their efforts in digital preservation.
Keywords :
XML; cloud computing; document handling; meta data; workflow management software; XML processing; ad-hoc peak computing; cloud computing; data conservation; digital preservation; document pipelines; document workflow; implementation-independent workflow designer; metadata archival; metadata extraction; validation checkpoint; Cloud computing; Complexity theory; Context; Organizations; Pipelines; Production; XML;
Conference_Titel :
Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium on
Conference_Location :
Shanghai
Print_ISBN :
978-1-61284-425-1
Electronic_ISBN :
1530-2075
DOI :
10.1109/IPDPS.2011.253