Title :
Optimizing large join queries using a graph-based approach
Author :
Lee, Chiang ; Shih, Chi-Sheng ; Chen, Yaw-Huei
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Abstract :
Although many query tree optimization strategies have been proposed in the literature, there still is a lack of a formal and complete representation of all possible permutations of query operations (i.e., execution plans) in a uniform manner. A graph-theoretic approach presented in the paper provides a sound mathematical basis for representing a query and searching for an execution plan. In this graph model, a node represents an operation and a directed edge between two nodes indicates the older of executing these two operations in an execution plan. Each node is associated with a weight and so is an edge. The weight is an expression containing optimization required parameters, such as relation size, tuple size, join selectivity factors. All possible execution plans are representable in this graph and each spanning tree of the graph becomes an execution plan. It is a general model which can be used in the optimizer of a DBMS for internal query representation. On the basis of this model, we devise an algorithm that finds a near optimal execution plan using only polynomial time. The algorithm is compared with a few other popular optimization methods. Experiments show that the proposed algorithm is superior to the others under most circumstances
Keywords :
graph theory; query processing; relational algebra; search problems; trees (mathematics); directed edge; execution plan; execution plans; general model; graph based approach; graph model; graph-theoretic approach; internal query representation; join selectivity factors; large join query optimization; mathematical basis; near optimal execution plan; polynomial time; popular optimization methods; query operations; query tree optimization strategies; relation size; spanning tree; tuple size; uniform manner; Cost function; Database systems; Graph theory; Helium; Iterative algorithms; Optimization methods; Polynomials; Query processing; Tree graphs; Upper bound;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on