DocumentCode
3811423
Title
Combinatorial optimization of distributed queries
Author
B. Groselj;Q.M. Malluhi
Author_Institution
BoyanTech Inc., McLean, VA, USA
Volume
7
Issue
6
fYear
1995
Firstpage
915
Lastpage
927
Abstract
In relational distributed databases a query cost consists of a local cost and a transmission cost. Query optimization is a combinatorial optimization problem. As the query size grows, the optimization methods based on exhaustive search become too expensive. We propose the following strategy for solving large distributed query optimization problems in relational database systems: (1) represent each query-processing schedule by a labeled directed graph; (2) reduce the number of different schedules by pruning away invalid or high-cost solutions; and (3) find a suboptimal schedule by combinatorial optimization. We investigate several combinatorial optimization techniques: random search, single start, multistart, simulated annealing, and a combination of random search and local simulated annealing. The utility of combinatorial optimization is demonstrated in the problem of finding the (sub)optimal semijoin schedule that fully reduces all relations of a tree query. The combination of random search and local simulated annealing was superior to other tested methods.
Keywords
"Simulated annealing","Costs","Optimization methods","Query processing","Processor scheduling","Relational databases","Database systems","Computer Society","Distributed databases","Testing"
Journal_Title
IEEE Transactions on Knowledge and Data Engineering
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/69.476497
Filename
476497
Link To Document