Title :
Adaptive Load Diffusion for Multiway Windowed Stream Joins
Author :
Xiaohui Gu ; Yu, Philip S. ; Haixun Wang
Author_Institution :
IBM T. J. Watson Res. Center, Hawthorne, NY, USA
Abstract :
In this paper, we present an adaptive load diffusion operator to enable scalable processing of multiway windowed stream joins (MWSJs) using a cluster system. The load diffusion is achieved by a set of novel semantics-pre serving tuple routing algorithms. Different from previous work, the load diffusion operator can (1) preserve the MWSJ semantics while spreading tuples to different hosts for parallel join processing; (2) achieve fine-grained load balancing among distributed hosts; and (3) perform semantics-preserving online adaptations to maintain optimal performance in dynamic stream environments. We have implemented a prototype of the distributed MWSJ framework on top of the System S distributed stream processing system. Our experiment results based on both real data streams and synthetic workloads show that the load diffusion algorithms can efficiently scale-up the performance of MWSJ processing with low overhead.
Keywords :
distributed processing; resource allocation; adaptive load diffusion; cluster system; diffusion operator; distributed host; dynamic stream environment; fine-grained load balancing; multiway windowed stream joins; parallel join processing; semantics-preserving tuple routing; Clustering algorithms; Image analysis; Intrusion detection; Load management; Monitoring; Prototypes; Query processing; Routing; Streaming media; Video surveillance;
Conference_Titel :
Data Engineering, 2007. ICDE 2007. IEEE 23rd International Conference on
Conference_Location :
Istanbul
Print_ISBN :
1-4244-0802-4
DOI :
10.1109/ICDE.2007.367860