DocumentCode
3264184
Title
Where´s Charlie: family based heuristics for peer-to-peer schema integration
Author
Tranier, John ; Baraër, Renaud ; Bellahsène, Zohra ; Teisseire, Maguelome
Author_Institution
LIRMM UMR, Univ. Montpellier II, France
fYear
2004
fDate
7-9 July 2004
Firstpage
227
Lastpage
235
Abstract
Querying semantically related data sources depends on the ability to map between their schemas. Unfortunately, in most cases matching between schemas is still largely performed manually. As a consequence, semantic integration issues have become a key bottleneck in the deployment of a large scale integration systems (the number of schemas to map is huge). This work deals with automated methods for matching and efficiently generating schema mappings in large scale environments. We propose a level-wise algorithm based on a semantic distance to evaluate similarity between schema nodes. However, in a large scale context, computing the semantic distance for every couple of nodes cannot be done. Thus, we proposed family based heuristics (CHARLIE) in order to efficiently generate mappings. Experiments have shown that our approach is very efficient for large scale integration especially in a super peer based architecture and that it is relevant for real datasets.
Keywords
peer-to-peer computing; query processing; CHARLIE; family based heuristics; large scale integration systems; level-wise algorithm; peer-to-peer schema integration; schema matching; schema node similarity; semantic distance; semantic querying; super peer based architecture; Peer to peer computing;
fLanguage
English
Publisher
ieee
Conference_Titel
Database Engineering and Applications Symposium, 2004. IDEAS '04. Proceedings. International
ISSN
1098-8068
Print_ISBN
0-7695-2168-1
Type
conf
DOI
10.1109/IDEAS.2004.1319795
Filename
1319795
Link To Document