Title :
Evolutionary Scheduling of Dynamic Multitasking Workloads for Big-Data Analytics in Elastic Cloud
Author :
Fan Zhang ; Junwei Cao ; Wei Tan ; Khan, Samee U. ; Keqin Li ; Zomaya, Albert Y.
Author_Institution :
Kavli Inst. for Astrophys. & Space Res., Massachusetts Inst. of Technol., Cambridge, MA, USA
Abstract :
Scheduling of dynamic and multitasking workloads for big-data analytics is a challenging issue, as it requires a significant amount of parameter sweeping and iterations. Therefore, real-time scheduling becomes essential to increase the throughput of many-task computing. The difficulty lies in obtaining a series of optimal yet responsive schedules. In dynamic scenarios, such as virtual clusters in cloud, scheduling must be processed fast enough to keep pace with the unpredictable fluctuations in the workloads to optimize the overall system performance. In this paper, ordinal optimization using rough models and fast simulation is introduced to obtain suboptimal solutions in a much shorter timeframe. While the scheduling solution for each period may not be the best, ordinal optimization can be processed fast in an iterative and evolutionary way to capture the details of big-data workload dynamism. Experimental results show that our evolutionary approach compared with existing methods, such as Monte Carlo and Blind Pick, can achieve higher overall average scheduling performance, such as throughput, in real-world applications with dynamic workloads. Furthermore, performance improvement is seen by implementing an optimal computing budget allocating method that smartly allocates computing cycles to the most promising schedules.
Keywords :
Big Data; Monte Carlo methods; cloud computing; evolutionary computation; iterative methods; optimisation; scheduling; Monte Carlo; average scheduling performance; big-data analytics; big-data workload dynamism; blind pick; computing cycles; dynamic multitasking workloads; elastic cloud; evolutionary approach; evolutionary scheduling; iterations; many-task computing; optimal computing budget allocating method; ordinal optimization; parameter sweeping; rough models; virtual clusters; Dynamic scheduling; Monte Carlo methods; Optimization; Processor scheduling; Schedules; Throughput; Big-data; cloud computing; evolutionary ordinal optimization; multitasking workload; virtual clusters;
Journal_Title :
Emerging Topics in Computing, IEEE Transactions on
DOI :
10.1109/TETC.2014.2348196