Title :
Optimizing Multiple Distributed Stream Queries Using Hierarchical Network Partitions
Author :
Seshadri, Sangeetha ; Kumar, Vibhore ; Cooper, Brian F. ; Liu, Ling
Author_Institution :
Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA
Abstract :
We consider the problem of query optimization in distributed data stream systems where multiple continuous queries may be executing simultaneously. In order to achieve the best performance, query planning (such as join ordering) must be considered in conjunction with deployment planning (e.g., assigning operators to physical nodes with optimal ordering). However, such a combination involves not only a large number of network nodes but also many query operators, resulting in an extremely large search space for optimal solutions. Our paper aims at addressing this problem by utilizing hierarchical network partitions. We propose two algorithms - top-down and bottom-up which utilize hierarchical network partitions to provide scalable query optimization. Formal analysis is presented to establish the bounds on the search-space and to show the sub-optimality of our algorithms. Through simulations and experiments using a prototype deployed on Emulab we demonstrate the effectiveness of our algorithms.
Keywords :
distributed databases; query processing; Emulab; distributed data stream system; hierarchical network partition; multiple distributed stream query optimization; Algorithm design and analysis; Computer networks; Costs; Distributed computing; Educational institutions; Network topology; Partitioning algorithms; Query processing; Runtime; Virtual prototyping;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International
Conference_Location :
Long Beach, CA
Print_ISBN :
1-4244-0910-1
Electronic_ISBN :
1-4244-0910-1
DOI :
10.1109/IPDPS.2007.370281