• DocumentCode
    1499240
  • Title

    Statistical relational databases: normal forms

  • Author

    Ghosh, Sakti P.

  • Author_Institution
    IBM Almaden Res. Center, San Jose, CA, USA
  • Volume
    3
  • Issue
    1
  • fYear
    1991
  • fDate
    3/1/1991 12:00:00 AM
  • Firstpage
    55
  • Lastpage
    64
  • Abstract
    Problems associated with defining normal forms of relational tables relevant to statistical processing are discussed. The concepts of derived identifier, class identifier, derived class-counts, count domains, compact domains, and uniform domains for statistical relational tables are introduced. The structures of the first and the second statistical-normal forms and the relational decompositions needed to achieve them are also discussed. It is shown that the statistical-normal form can be an important method to determine whether the usual statistical analysis techniques are valid. Some suggestions are presented for extending the structured query language (SQL) statements to achieve these operations on statistical relational tables. Some results linking Codd´s normal forms with statistical normal forms are discussed. Relational statistical abnormalities, called outlyers, are also discussed
  • Keywords
    query languages; relational databases; SQL; class identifier; compact domains; count domains; derived class-counts; derived identifier; normal forms; outlyers; relational decompositions; relational tables; statistical abnormalities; statistical analysis; statistical relational databases; structured query language; uniform domains; Aggregates; Algebra; Data compression; Data security; Data structures; Joining processes; Query processing; Relational databases; Statistical analysis;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/69.75889
  • Filename
    75889