DocumentCode
3038874
Title
XEM: managing the evolution of XML documents
Author
Su, Hong ; Kramer, Diane ; Chen, Li ; Claypool, Kajal ; Rundensteiner, Elke A.
Author_Institution
Dept. of Comput. Sci., Worcester Polytech. Inst., MA, USA
fYear
2001
fDate
2001
Firstpage
103
Lastpage
110
Abstract
As information on the World Wide Web continues to proliferate at an astounding rate, the Extensible Markup Language (XML) has been emerging as a standard format for data representation on the Web. In many applications, specific document type definitions (DTDs) are designed to enforce a semantically agreed-upon structure of the XML documents for management. However, both the data and the structure of XML documents tend to change over time for a multitude of reasons, including to correct design errors in the DTD, to allow expansion of the application scope over time, or to account for the merging of several businesses into one. However most of the current software tools that enable the use of XML do not provide explicit support for such data or schema changes. In this vein, we put forth the first solution framework, called XML Evolution Manager (XEM) to manage the evolution of XML. XEM provides a minimal yet complete taxonomy of basic change primitives. These primitives, classified as either data changes or schema changes, are consistency-preserving, i.e., for a data change, they ensure that the modified XML document conforms to its DTD both in structure and constraints; and for a schema change, they ensure that the new DTD is a valid DTD and all existing XML documents are transformed also to conform to the modified DTD. We prove the completeness of the taxonomy in terms of DTD transformation. To verify the feasibility of our XEM approach we have implemented a working prototype system using PSE Pro as our backend storage system
Keywords
Internet; data models; document handling; hypermedia markup languages; information resources; Extensible Markup Language; PSE Pro; World Wide Web; XEM; XML Evolution Manager; XML document evolution management; backend storage system; data change; data model; data representation; document type definitions; prototype system; schema change; Application software; Computer science; Error correction; Merging; Prototypes; Software tools; Taxonomy; Veins; Web sites; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
Research Issues in Data Engineering, 2001. Proceedings. Eleventh International Workshop on
Conference_Location
Heidelberg
Print_ISBN
0-7695-0957-6
Type
conf
DOI
10.1109/RIDE.2001.916497
Filename
916497
Link To Document