• DocumentCode
    2507136
  • Title

    PXML: a probabilistic semistructured data model and algebra

  • Author

    Hung, Edward ; Getoor, Lise ; Subrahmanian, V.S.

  • Author_Institution
    Dept. of Comput. Sci., Maryland Univ., College Park, MD, USA
  • fYear
    2003
  • fDate
    5-8 March 2003
  • Firstpage
    467
  • Lastpage
    478
  • Abstract
    Despite the recent proliferation of work on semistructured data models, there has been little work to date on supporting uncertainty in these models. We propose a model for probabilistic semistructured data (PSD). The advantage of our approach is that it supports a flexible representation that allows the specification of a wide class of distributions over semistructured instances. We provide two semantics for the model and show that the semantics are probabilistically coherent. Next, we develop an extension of the relational algebra to handle probabilistic semistructured data and describe efficient algorithms for answering queries that use this algebra. Finally, we present experimental results showing the efficiency of our algorithms.
  • Keywords
    XML; data models; query processing; relational algebra; relational databases; tree data structures; PXML; probabilistic semistructured data model; query processing; relational algebra; relational databases; semistructured instances; tree data structures; Algebra; Data engineering; Data models;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2003. Proceedings. 19th International Conference on
  • Print_ISBN
    0-7803-7665-X
  • Type

    conf

  • DOI
    10.1109/ICDE.2003.1260814
  • Filename
    1260814