Title :
Copy-based versus edit-based version management schemes for structured documents
Author :
Chien, Shu-Yao ; Tsotras, Vassilis J. ; Zaniolo, Carlo
Author_Institution :
Dept. of Comput. Sci., California Univ., Los Angeles, CA, USA
Abstract :
Managing multiple versions of XML documents and semistructured data represents a problem of growing interest. Traditional version control methods, such as RCS, use edit scripts representing changes in the document to support the incremental reconstruction of different versions. The edit-based approaches have been enhanced with a replication scheme called UBCC (Chien et al., 2000). UBCC is based on the notion of page usefulness and ensures effective management for multi-version documents in terms of both retrieval and storage cost. These improvements notwithstanding, the edit-based representation suffers from limited generality and flexibility-e.g., it cannot represent changes such as rearranging the document or duplicating parts of its content. To solve these problems, the paper proposes a copy-based UBCC versioning scheme, which also provides a simpler format for the electronic exchange of multi-version documents. With the objective of matching the performance of the edit-based UBCC technique, we develop algorithms that enhance the copy-based UBCC scheme with page usefulness management. We also present results of various experiments that test the storage and retrieval performance of the new copy-based approach, and compare it with that of the edit-based UBCC approach
Keywords :
configuration management; document handling; hypermedia markup languages; software performance evaluation; UBCC; XML documents; copy-based version management; edit scripts; edit-based version management; experiments; multi-version documents; page usefulness management; replication scheme; retrieval performance; semistructured data; structured documents; Collaborative work; Computer science; Content management; Costs; History; Intrusion detection; Protection; Standards development; Testing; XML;
Conference_Titel :
Research Issues in Data Engineering, 2001. Proceedings. Eleventh International Workshop on
Conference_Location :
Heidelberg
Print_ISBN :
0-7695-0957-6
DOI :
10.1109/RIDE.2001.916496