DocumentCode :
2458021
Title :
Evaluating Probabilistic Queries over Uncertain Matching
Author :
Cheng, Reynold ; Gong, Jian ; Cheung, David W. ; Cheng, Jiefeng
Author_Institution :
Dept. of Comput. Sci., Univ. of Hong Kong, Hong Kong, China
fYear :
2012
fDate :
1-5 April 2012
Firstpage :
1096
Lastpage :
1107
Abstract :
A matching between two database schemas, generated by machine learning techniques (e.g., COMA++), is often uncertain. Handling the uncertainty of schema matching has recently raised a lot of research interest, because the quality of applications rely on the matching result. We study query evaluation over an inexact schema matching, which is represented as a set of ``possible mappings´´, as well as the probabilities that they are correct. Since the number of possible mappings can be large, evaluating queries through these mappings can be expensive. By observing the fact that the possible mappings between two schemas often exhibit a high degree of overlap, we develop two efficient solutions. We also present a fast algorithm to compute answers with the k highest probabilities. An extensive evaluation on real schemas shows that our approaches improve the query performance by almost an order of magnitude.
Keywords :
database management systems; learning (artificial intelligence); probability; query processing; uncertainty handling; database schema matching; machine learning techniques; magnitude order; possible mappings; probabilistic query evaluation; query performance; uncertain matching; uncertainty handling; Aggregates; Algorithm design and analysis; Partitioning algorithms; Probabilistic logic; Query processing; Uncertainty;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2012 IEEE 28th International Conference on
Conference_Location :
Washington, DC
ISSN :
1063-6382
Print_ISBN :
978-1-4673-0042-1
Type :
conf
DOI :
10.1109/ICDE.2012.14
Filename :
6228159
Link To Document :
بازگشت