• DocumentCode
    3738334
  • Title

    An enforcement of real time scheduling in Spark Streaming

  • Author

    Xinyi Liao; Zhiwei Gao; Weixing Ji; Yizhuo Wang

  • Author_Institution
    School of Computer Science, Beijing Institute of Technology, 100081, USA
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    With the exponential growth in continuous data streams, real time streaming processing has been gaining a lot of popularity. Spark Streaming is one of the open source frameworks for reliable, high-throughput and low latency stream processing. Though it is a near real time stream processing framework running on commodity hardware, real time event processing is not guaranteed in its scheduling system. Profiling results indicate that the total delay time of events with unstable inputs is more volatile and presents big fluctuations. In this paper, we propose a simple, yet effective scheduling strategy to reduce the worst case event processing time by dynamic adjusting the time window of batch intervals. It is a real time enhancement to Spark Streaming based on Spark´s framework. The proposed strategy is evaluated using two streaming benchmarks and our preliminary results demonstrate the feasibility of our approach with unstable event streams.
  • Keywords
    "Delays","Sparks","Real-time systems","Scheduling","Processor scheduling","Data processing","Reliability"
  • Publisher
    ieee
  • Conference_Titel
    Green Computing Conference and Sustainable Computing Conference (IGSC), 2015 Sixth International
  • Type

    conf

  • DOI
    10.1109/IGCC.2015.7393730
  • Filename
    7393730