• DocumentCode
    2314904
  • Title

    Relational Operators in Heterogeneous Random Databases

  • Author

    Velcescu, Letitia ; Vasile, Laurentiu

  • Author_Institution
    Fac. of Math. & Inf., Univ. of Bucharest, Bucharest, Romania
  • fYear
    2009
  • fDate
    26-29 Sept. 2009
  • Firstpage
    407
  • Lastpage
    412
  • Abstract
    In this paper, we investigate the sizes of some approximate relational operations results, focusing on join, outer join and difference. We extend the notion of random database, in which the records are random vectors following a certain probability distribution, to heterogeneous random databases, in which each column can have its own unidimensional distribution. In this framework, we will investigate if the results already existing for the homogeneous databases remain true. Our approach follows three steps. First, we build up the histograms for some relational operations on heterogeneous tables with specific distributions, then we apply the chi square test of goodness of fit and, in the end, we prove the result that sets the limits for which the cardinality of the self-join can be approximated by a Poisson distribution.
  • Keywords
    Poisson distribution; distributed databases; relational databases; Poisson distribution; chi square test; goodness of fit; heterogeneous random databases; probability distribution; random vectors; relational operators; unidimensional distribution; Relational databases; Scientific computing; Poisson distribution; Random database; approximate relational operation; chi square test of goodness of fit; database optimization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Symbolic and Numeric Algorithms for Scientific Computing (SYNASC), 2009 11th International Symposium on
  • Conference_Location
    Timisoara
  • Print_ISBN
    978-1-4244-5910-0
  • Electronic_ISBN
    978-1-4244-5911-7
  • Type

    conf

  • DOI
    10.1109/SYNASC.2009.50
  • Filename
    5460821