DocumentCode :
3718823
Title :
Spark on entropy: A reliable & efficient scheduler for low-latency parallel jobs in heterogeneous cloud
Author :
Huankai Chen;Frank Z Wang
Author_Institution :
Future Computing Group, School of Computing, University of Kent, Canterbury, UK
fYear :
2015
Firstpage :
708
Lastpage :
713
Abstract :
In heterogeneous cloud, the provision of quality of service (QoS) guarantees for on-line parallel analysis jobs is much more challenging than off-line ones, mainly due to the many involved parameters, unstable resource performance, various job pattern and dynamic query workload. In this paper we propose an entropy-based scheduling strategy for running the on-line parallel analysis as a service more reliable and efficient, and implement the proposed idea in Spark. Entropy, as a measure of the degree of disorder in a system, is an indicator of a system´s tendency to progress out of order and into a chaotic condition, and it can thus serve to measure a cloud resource´s reliability for jobs scheduling. The key idea of our Entropy Scheduler is to construct the new resource entropy metric and schedule tasks according to the resources ranking with the help of the new metric so as to provide QoS guarantees for on-line Spark analysis jobs. Experiments demonstrate that our approach significantly reduces the average query response time by 15% - 20% and standard deviation by 30% - 45% compare with the native Fair Scheduler in Spark.
Keywords :
"Sparks","Entropy","Cloud computing","Reliability","Dynamic scheduling","Job shop scheduling"
Publisher :
ieee
Conference_Titel :
Local Computer Networks Conference Workshops (LCN Workshops), 2015 IEEE 40th
Type :
conf
DOI :
10.1109/LCNW.2015.7365918
Filename :
7365918
Link To Document :
بازگشت