• DocumentCode
    2193126
  • Title

    Access Support Tree and TextArray: a data structure for XML document storage and retrieval

  • Author

    Scheffner, Dieter ; Freytag, Johann-Christoph

  • Author_Institution
    Dept. of Comput. Sci., Humboldt-Univ., Berlin, Germany
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    155
  • Lastpage
    164
  • Abstract
    The characteristics of XML documents require new ways of storing and querying such documents. Queries on both textual content and structural aspects must be supported efficiently. For this reason, we examined existing work on both document storage approaches and models for querying documents to derive requirements that are essential for the storage of XML documents. As a result of our study, we designed the Access Support Tree and TextArray (AST/TA) data structure. The important idea of the AST/TA data structure is the separation of the (logical) structure of a document from its "visible" text content. The latter is represented as a single contiguous string. At the same time the AST/TA data structure provides a tight integration to guarantee consistent changes. We introduce the AST/TA data structure formally by, its abstraction, namely the AST/TA model and compare requirements of our AST/TA approach with those found in the current literature. Finally, we describe the advantage of the AST/TA model based on the AST/TA design principles.
  • Keywords
    hypermedia markup languages; multimedia databases; natural sciences computing; query processing; tree data structures; Access Support Tree and TextArray data structure; XML document retrieval; XML document storage; contiguous string; structural aspects; textual content; Computer science; Content based retrieval; Data structures; Database languages; Information retrieval; Internet; Merging; Search engines; Tree data structures; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Scientific and Statistical Database Management, 2002. Proceedings. 14th International Conference on
  • ISSN
    1099-3371
  • Print_ISBN
    0-7695-1632-7
  • Type

    conf

  • DOI
    10.1109/SSDM.2002.1029716
  • Filename
    1029716