DocumentCode
1499240
Title
Statistical relational databases: normal forms
Author
Ghosh, Sakti P.
Author_Institution
IBM Almaden Res. Center, San Jose, CA, USA
Volume
3
Issue
1
fYear
1991
fDate
3/1/1991 12:00:00 AM
Firstpage
55
Lastpage
64
Abstract
Problems associated with defining normal forms of relational tables relevant to statistical processing are discussed. The concepts of derived identifier, class identifier, derived class-counts, count domains, compact domains, and uniform domains for statistical relational tables are introduced. The structures of the first and the second statistical-normal forms and the relational decompositions needed to achieve them are also discussed. It is shown that the statistical-normal form can be an important method to determine whether the usual statistical analysis techniques are valid. Some suggestions are presented for extending the structured query language (SQL) statements to achieve these operations on statistical relational tables. Some results linking Codd´s normal forms with statistical normal forms are discussed. Relational statistical abnormalities, called outlyers, are also discussed
Keywords
query languages; relational databases; SQL; class identifier; compact domains; count domains; derived class-counts; derived identifier; normal forms; outlyers; relational decompositions; relational tables; statistical abnormalities; statistical analysis; statistical relational databases; structured query language; uniform domains; Aggregates; Algebra; Data compression; Data security; Data structures; Joining processes; Query processing; Relational databases; Statistical analysis;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/69.75889
Filename
75889
Link To Document