DocumentCode :
1856718
Title :
Entity matching in heterogeneous databases: a distance-based decision model
Author :
Dey, Debabrata ; Sarkar, Sumit ; De, Pradipta
Author_Institution :
Washington Univ., USA
Volume :
7
fYear :
1998
fDate :
6-9 Jan 1998
Firstpage :
305
Abstract :
The need to leverage the information contained in heterogeneous data sources has been widely documented. In order to accomplish this goal, an organization must resolve several types of heterogeneity problems that may exist across different data sources. We investigate one such problem called the entity heterogeneity problem. This problem arises when the same real-world entity type is represented using different identifiers in different applications. We propose a decision theoretic model to resolve the problem. Our model uses a distance-based measure to express the similarity between two entity instances. We have implemented the model, and our experimental results indicate that this is a viable approach in real-world situations
Keywords :
business data processing; database theory; decision theory; distributed databases; entity-relationship modelling; relational databases; data sources; decision theoretic model; distance-based decision model; entity heterogeneity problem; entity matching; entity relationship model; heterogeneous databases; identifiers; organization; relational database; Application software; Data analysis; Data structures; Databases; Information analysis; Investments; Management information systems; Protocols; Standards organizations; System testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
System Sciences, 1998., Proceedings of the Thirty-First Hawaii International Conference on
Conference_Location :
Kohala Coast, HI
Print_ISBN :
0-8186-8255-8
Type :
conf
DOI :
10.1109/HICSS.1998.649225
Filename :
649225
Link To Document :
بازگشت