Title :
Parallelizing windowed stream joins in a shared-nothing cluster
Author :
Chakraborty, Arpan ; Singh, Ashutosh
Author_Institution :
Sch. of Inf. & Comput. Sci., Indiana Univ., Bloomington, IN, USA
Abstract :
The availability of a large number of processing nodes in a parallel and distributed computing environment enables sophisticated real time processing over high speed data streams, as required by many emerging applications. Sliding window stream joins are among the most important operators in a stream processing system. In this paper, we consider the issue of parallelizing a sliding window stream join operator over a shared nothing cluster. We propose a Bulk Synchronous Processing (BSP) framework, based on a fixed or predefined sequence of communication, to distribute the join processing loads over a shared-nothing cluster. We consider various processing and communication overheads while scaling over a large number of nodes, and propose solution methodologies to cope with the issues.We implement the algorithm over a cluster using a message passing system, and present the experimental results showing the effectiveness of the join processing algorithm.
Keywords :
application program interfaces; message passing; parallel processing; BSP framework; bulk synchronous processing framework; communication overhead; distributed computing environment; join processing algorithm; message passing system; parallel computing environment; parallelization; processing nodes; processing overhead; shared nothing cluster; shared-nothing cluster; stream processing system; windowed stream joins; Databases; Delays; Heuristic algorithms; Parallel processing; Partitioning algorithms; Scalability; Tuning;
Conference_Titel :
Cluster Computing (CLUSTER), 2013 IEEE International Conference on
Conference_Location :
Indianapolis, IN
DOI :
10.1109/CLUSTER.2013.6702636