Title :
Nonparametric Bayesian Modeling for Automated Database Schema Matching
Author :
Erik Ferragut;Jason Laska
Abstract :
The problem of merging databases arises in many government and commercial applications. Schema matching, a common first step, identifies equivalent fields between databases. We introduce a schema matching framework that builds nonparametric Bayesian models for each field and compares them by computing the probability that a single model could have generated both fields. Our experiments show that our method is more accurate and faster than the existing instance-based matching algorithms in part because of the use of nonparametric Bayesian models.
Keywords :
"Computational modeling","Bayes methods","Government","Data integration","Databases","Metadata"
Conference_Titel :
Machine Learning and Applications (ICMLA), 2015 IEEE 14th International Conference on
DOI :
10.1109/ICMLA.2015.235