DocumentCode :
2732478
Title :
QPIAD: Query Processing over Incomplete Autonomous Databases
Author :
Khatri, H. ; Jianchun Fan ; Yi Chen ; Kambhampati, S.
Author_Institution :
Dept. of Comput. Sci. & Eng., Arizona State Univ., Tempe, AZ, USA
fYear :
2007
fDate :
15-20 April 2007
Firstpage :
1430
Lastpage :
1432
Abstract :
Incompleteness due to missing attribute values (aka "null values") is very common in autonomous Web databases, on which user accesses are usually supported through mediators. Traditional query processing techniques that focus on the strict soundness of answer tuples often ignore tuples with critical missing attributes, even if they wind up being relevant to a user query. Ideally we would like the mediator to retrieve such relevant uncertain answers and gauge their relevance by accessing their likelihood of being relevant answers to the query. However, the autonomous nature of the databases poses several challenges, such as the restricted access privileges, limited query patterns, and sensitivity of database and network resource consumption in the Web environment. We introduce a novel query rewriting and optimization framework QPIAD that tackles these challenges to retrieve relevant uncertain answers. Our technique involves reformulating the user query based on approximate functional dependencies (AFDs) among the database attributes and ranking these queries using value distributions learned from naive Bayes classifiers. Empirical studies demonstrate the effectiveness of our approach in retrieving relevant uncertain answers with high precision, high recall and manageable cost.
Keywords :
Bayes methods; Internet; database management systems; pattern classification; query processing; answer tuples; approximate functional dependencies; autonomous Web databases; incomplete autonomous databases; information retrieval; naive Bayes classifiers; null values; query optimization; query processing; query rewriting; Acoustical engineering; Computer science; Costs; Creep; Data engineering; Databases; Information retrieval; Null value; Query processing; Web server; autonomous databases; incomplete databases; query rewriting; querying hidden web;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2007. ICDE 2007. IEEE 23rd International Conference on
Conference_Location :
Istanbul
Print_ISBN :
1-4244-0802-4
Type :
conf
DOI :
10.1109/ICDE.2007.369028
Filename :
4221818
Link To Document :
بازگشت