• DocumentCode
    2699157
  • Title

    A Mapping Scheme of XML Documents into Relational Databases using Schema-based Path Identi.ers

  • Author

    Fujimoto, Kenji ; Shimizu, Tsuyoshi ; Dao Dinh Kha ; Yoshikawa, Masatoshi ; Amagasa, Toshiyuki

  • Author_Institution
    Graduate Sch. of Inf. Sci., Nagoya Univ.
  • fYear
    2005
  • fDate
    8-9 April 2005
  • Firstpage
    82
  • Lastpage
    90
  • Abstract
    In this paper, we propose a mapping scheme of XML documents into relational databases. The scheme enables us to store, retrieve and update XML documents efficiently. When storing XML documents in relational databases, XML tree structures must be preserved explicitly. To this end, a label is assigned to nodes in the XML tree. In general, document retrieval and update performance is affected by node labeling schemes. We use SPIDER (Schema based Path IDentifiER), a labeling scheme for XML documents utilizing DTDs that makes retrieval and update more efficient. SPIDER only identifies paths from root node to a node. Thus, multiple nodes appearing in the same path cannot be distinguished by only using SPIDER. We introduced Sibling Dewey Order to identify such nodes. Generally, when a new node is inserted into XML documents, some other nodes need to be relabeled to preserve the order of nodes. In our method, only Sibling Dewey Order is relabeled; SPIDER is not affected. Since the range of relabeling is small, it is possible to update documents efficiently. We stored documents utilizing SPIDER in a relational database and then translated various XPath expressions into SQL using SPIDER. We perform experiments and demonstrate that the proposed scheme outpeforms conventional methods both in retrieval and update
  • Keywords
    SQL; XML; document handling; information retrieval; relational databases; tree data structures; SPIDER; SQL; Sibling Dewey Order; XML document mapping; XML tree structures; XPath expressions; document retrieval; document type definition; node labeling schemes; relational databases; schema based path identifier; schema-based path identifiers; update performance; Data models; Information retrieval; Information science; Information technology; Labeling; Object oriented databases; Object oriented modeling; Relational databases; Tree data structures; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Information Retrieval and Integration, 2005. WIRI '05. Proceedings. International Workshop on Challenges in
  • Conference_Location
    Tokyo
  • Print_ISBN
    0-7695-2414-1
  • Type

    conf

  • DOI
    10.1109/WIRI.2005.4
  • Filename
    1552999