Title :
Data integration by describing sources with constraint databases
Author :
Cheng, Xun ; Dong, Guozhu ; Lau, Tzekwan ; Su, Jianwen
Author_Institution :
Dept. of Comput. Sci., California Univ., Santa Barbara, CA, USA
Abstract :
We develop a data integration approach for the efficient evaluation of queries over autonomous source databases. The approach is based on some novel applications and extensions of constraint database techniques. We assume the existence of a global database schema. The contents of each data source are described using a set of constraint tuples over the global schema; each such tuple indicates possible contributions from the source. The “source description catalog” (SDC) of a global relation consists of its associated constraint tuples. Such a method of description is advantageous since it is flexible to add new sources and to modify existing ones. In our framework, to evaluate a conjunctive query over the global schema, a plan generator first identifies relevant data sources by “evaluating” the query against the SDCs using techniques of constraint query evaluation; it then formulates an evaluation plan, consisting of some specialized queries over different paths. The evaluation of a query associated with a path is done by a sequence of partial evaluations at data sources along the path, similar to sideways information passing of Datalog; the partially evaluated queries travel along their associated paths. Our SDC based query planning is efficient since it avoids the NP-complete query rewriting process. We can achieve further optimization using techniques such as emptiness test
Keywords :
constraint handling; deductive databases; query processing; Datalog; NP-complete query rewriting process; SDC based query planning; autonomous source databases; conjunctive query; constraint databases; constraint query evaluation; constraint tuples; data integration approach; data source; data sources; emptiness test; evaluation plan; global database schema; global relation; global schema; partial evaluations; partially evaluated queries; plan generator; sideways information passing; source description catalog; specialized queries; Application software; Computer science; Data engineering; Data warehouses; Databases; Process planning; Query processing; Software libraries; Testing; Warehousing;
Conference_Titel :
Data Engineering, 1999. Proceedings., 15th International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
0-7695-0071-4
DOI :
10.1109/ICDE.1999.754953