• DocumentCode
    1723834
  • Title

    Algebraic operator support for semantic data fusion in extended SQL

  • Author

    Hosain, Shazzad ; Jamil, Hasan

  • fYear
    2010
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    One of the basic operations required to gather more information about an object is called information aggregation or data fusion. The process requires recognition of a semantic object and gathering the new information into the collection that already exists for that object. Another related operation is collecting a set of distinct semantic objects that are similar. These operations become complicated in the presence of schema and extent heterogeneity and semantic similarity. Although a rich body of research addressed these issues in the literature, a database language support is yet available possibly because an algebraic formulation of these concepts was absent. An algebraic characterization is needed for query plan generation, optimization and query processing. In this paper, we propose two new binary operators called link (λ) and combine (χ) that capture the spirit of vertical and horizontal data fusion. The proposed operators leverage the development in schema matching and key identification technologies by casting them as user selectable functions μ and κ. We show that link and combine are generalized versions of traditional join and union operations. We also propose two extensions of SQL that exploits these two operators and opens up many optimization possibilities. We also point out that link and combine are also useful for semantic data integration and are currently being used in LifeDB data management system for Life Sciences applications.
  • Keywords
    SQL; algebra; database management systems; natural sciences computing; optimisation; query processing; sensor fusion; LifeDB data management system; algebraic operator support; extended SQL; information aggregation; life sciences applications; optimization; query plan generation; query processing; semantic data fusion; semantic object recognition; Couplings; Database languages; Databases; Google; Object recognition; Optimization; Semantics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cybernetic Intelligent Systems (CIS), 2010 IEEE 9th International Conference on
  • Conference_Location
    Reading
  • Print_ISBN
    978-1-4244-9023-3
  • Electronic_ISBN
    978-1-4244-9024-0
  • Type

    conf

  • DOI
    10.1109/UKRICIS.2010.5898129
  • Filename
    5898129