• DocumentCode
    2428857
  • Title

    Automatic aggregation using explicit metadata

  • Author

    Grumbach, Stéphane ; Tininini, Leonardo

  • Author_Institution
    Inst. Nat. de Recherche en Inf. et Autom., Le Chesnay, France
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    85
  • Lastpage
    94
  • Abstract
    The paper presents a logical data model for statistical data with an explicit modeling of metadata, which allows to perform automatic aggregation. The data are stored in standard relations from the relational model, while the metadata, defining the semantics of the relations, are represented by numerical dependencies which specify the way the summary values are defined in terms of micro-data, as well as the interrelationships among summary values. The present model supports standard relational languages such as SQL. Relations with numerical dependencies are then seen as statistical views over initial relations of micro-data. Queries can be asked either against the views or directly against the initial relations, and in this later case answered, when possible, using the views. The numerical dependencies of the views are run together with the query to compute the answer to the query. This is handled in a completely automatic manner with no need for the user to deal with the intricacy of metadata. The mechanism has been tested by an implementation in Prolog of meaningful examples of queries and dependencies. It is shown in particular that various classical problems in the realm of statistical and multidimensional databases can be easily modeled and solved in the present framework. Finally, the proposed formalism is shown to be useful for statistical database schema design
  • Keywords
    PROLOG; relational databases; statistical databases; Prolog; `; automatic aggregation; explicit metadata; explicit modeling; logical data model; multidimensional databases; numerical dependencies; relational model; standard relational languages; statistical data; statistical database; Aggregates; Data mining; Data models; Data privacy; Database systems; Multidimensional systems; Protection; Statistical analysis; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Scientific and Statistical Database Management, 2000. Proceedings. 12th International Conference on
  • Conference_Location
    Berlin
  • ISSN
    1099-3371
  • Print_ISBN
    0-7695-0686-0
  • Type

    conf

  • DOI
    10.1109/SSDM.2000.869780
  • Filename
    869780