• DocumentCode
    3269883
  • Title

    A flexible infrastructure for gathering XML statistics and estimating query cardinality

  • Author

    Freire, Juliana ; Ramanath, Maya ; Zhang, Lingzhi

  • fYear
    2004
  • fDate
    30 March-2 April 2004
  • Firstpage
    857
  • Abstract
    A key component of XML data management systems is the result size estimator, which estimates the cardinalities of user queries. Estimated cardinalities are needed in a variety of tasks, including query optimization and cost-based storage design; and they can also be used to give users early feedback about the expected outcome of their queries. In contrast to previously proposed result estimators, which use specialized data structures and estimation algorithms, StatiX uses histograms to uniformly capture both the structural and value skew present in documents. The original version of StatiX was built as a proof of concept. With the goal of making the system publicly available, we have built StatiX++, a new and improved version of StatiX, which extends the original system in significant ways. In this demonstration, we show the key features of StatiX++.
  • Keywords
    XML; data structures; query processing; statistical databases; StatiX++ system; XML data management systems; XML statistics; cost-based storage design; data structures; estimation algorithms; histograms; publicly available system; query cardinality estimation; query optimization; result size estimator; Statistics; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2004. Proceedings. 20th International Conference on
  • ISSN
    1063-6382
  • Print_ISBN
    0-7695-2065-0
  • Type

    conf

  • DOI
    10.1109/ICDE.2004.1320085
  • Filename
    1320085