Title :
Finding Optimal Join Tree forMulti-Join Stream Queries in a Production System
Author :
Gomes, Joseph S. ; Choi, Heyong-Ah
Author_Institution :
George Washington University
Abstract :
Data Stream Management Systems (DSMS) handle a particular type of database applications that involve multiple continuous data streams with inputs arriving at highly variable and unpredictable rates. Since data rate fluctuates over time in this type of applications the appropriate join tree is crucial for maintaining high system throughput. We consider the problem of finding optimal join tree for performing count based sliding window multi-joins over continuous streams. We use a unit-time based cost model to evaluate the expected performance for a given join tree. We materialize all intermediate results assuming there is enough main memory to store all partial results and window buffers. We give a polynomial time algorithm that finds the optimal join tree under our cost model for a given noncommuting (single permutation) order of streams. This algorithm can be used in conjunction with any linear order producing heuristic to give the optimal tree for that order. Our algorithm is implemented in the Jess rule engine and an extensive experimental evaluation is provided.
Keywords :
Buffer storage; Cost function; Database systems; Engines; Monitoring; Polynomials; Production systems; Signal processing algorithms; Telecommunication traffic; Throughput;
Conference_Titel :
Distributed Computing Systems Workshops, 2006. ICDCS Workshops 2006. 26th IEEE International Conference on
Print_ISBN :
0-7695-2541-5
DOI :
10.1109/ICDCSW.2006.53