DocumentCode
3738334
Title
An enforcement of real time scheduling in Spark Streaming
Author
Xinyi Liao; Zhiwei Gao; Weixing Ji; Yizhuo Wang
Author_Institution
School of Computer Science, Beijing Institute of Technology, 100081, USA
fYear
2015
Firstpage
1
Lastpage
6
Abstract
With the exponential growth in continuous data streams, real time streaming processing has been gaining a lot of popularity. Spark Streaming is one of the open source frameworks for reliable, high-throughput and low latency stream processing. Though it is a near real time stream processing framework running on commodity hardware, real time event processing is not guaranteed in its scheduling system. Profiling results indicate that the total delay time of events with unstable inputs is more volatile and presents big fluctuations. In this paper, we propose a simple, yet effective scheduling strategy to reduce the worst case event processing time by dynamic adjusting the time window of batch intervals. It is a real time enhancement to Spark Streaming based on Spark´s framework. The proposed strategy is evaluated using two streaming benchmarks and our preliminary results demonstrate the feasibility of our approach with unstable event streams.
Keywords
"Delays","Sparks","Real-time systems","Scheduling","Processor scheduling","Data processing","Reliability"
Publisher
ieee
Conference_Titel
Green Computing Conference and Sustainable Computing Conference (IGSC), 2015 Sixth International
Type
conf
DOI
10.1109/IGCC.2015.7393730
Filename
7393730
Link To Document