DocumentCode
2314904
Title
Relational Operators in Heterogeneous Random Databases
Author
Velcescu, Letitia ; Vasile, Laurentiu
Author_Institution
Fac. of Math. & Inf., Univ. of Bucharest, Bucharest, Romania
fYear
2009
fDate
26-29 Sept. 2009
Firstpage
407
Lastpage
412
Abstract
In this paper, we investigate the sizes of some approximate relational operations results, focusing on join, outer join and difference. We extend the notion of random database, in which the records are random vectors following a certain probability distribution, to heterogeneous random databases, in which each column can have its own unidimensional distribution. In this framework, we will investigate if the results already existing for the homogeneous databases remain true. Our approach follows three steps. First, we build up the histograms for some relational operations on heterogeneous tables with specific distributions, then we apply the chi square test of goodness of fit and, in the end, we prove the result that sets the limits for which the cardinality of the self-join can be approximated by a Poisson distribution.
Keywords
Poisson distribution; distributed databases; relational databases; Poisson distribution; chi square test; goodness of fit; heterogeneous random databases; probability distribution; random vectors; relational operators; unidimensional distribution; Relational databases; Scientific computing; Poisson distribution; Random database; approximate relational operation; chi square test of goodness of fit; database optimization;
fLanguage
English
Publisher
ieee
Conference_Titel
Symbolic and Numeric Algorithms for Scientific Computing (SYNASC), 2009 11th International Symposium on
Conference_Location
Timisoara
Print_ISBN
978-1-4244-5910-0
Electronic_ISBN
978-1-4244-5911-7
Type
conf
DOI
10.1109/SYNASC.2009.50
Filename
5460821
Link To Document