Title :
Automatic generation of mediated schemas through reasoning over data dependencies
Author :
Li, Xiang ; Quix, Christoph ; Kensche, David ; Geisler, Sandra ; Guo, Lisong
Author_Institution :
Inf. 5 (Databases & Inf. Syst.), RWTH Aachen Univ., Aachen, Germany
Abstract :
Mediated schemas lie at the center of the well recognized data integration architecture. Classical data integration systems rely on a mediated schema created by human experts through an intensive design process. Automatic generation of mediated schemas is still a goal to be achieved. We generate mediated schemas by merging multiple source schemas interrelated by tuple-generating dependencies (tgds). Schema merging is the process to consolidate multiple schemas into a unified view. The task becomes particularly challenging when the schemas are highly heterogeneous and autonomous. Existing approaches fall short in various aspects, such as restricted expressiveness of input mappings, lacking data level interpretation, the output mapping is not in a logical language (or not given at all), and being confined to binary merging. We present here a novel system which is able to perform native n-ary schema merging using P2P style tgds as input. Suited in the scenario of generating mediated schemas for data integration, the system opts for a minimal schema signature retaining all certain answers of conjunctive queries. Logical output mappings are generated to support the mediated schemas, which enable query answering and, in some cases, query rewriting.
Keywords :
data handling; peer-to-peer computing; query processing; question answering (information retrieval); P2P style; conjunctive queries; data dependencies; data integration architecture; input mapping restricted expressiveness; lacking data level interpretation; logical output mappings; mediated schema automatic generation; multiple source schema merging; n-ary schema merging; query answering; query rewriting; tuple-generating dependencies; Cognition; Engines; Joints; Merging; Query processing; Semantics;
Conference_Titel :
Data Engineering (ICDE), 2011 IEEE 27th International Conference on
Conference_Location :
Hannover
Print_ISBN :
978-1-4244-8959-6
Electronic_ISBN :
1063-6382
DOI :
10.1109/ICDE.2011.5767913