Title :
Accelerating Batch Analytics with Residual Resources from Interactive Clouds
Author :
Clay, R. Benjamin ; Zhiming Shen ; Xiaosong Ma
Author_Institution :
Dept. of Comput. Sci., North Carolina State Univ., Raleigh, NC, USA
Abstract :
The popularity of cloud-based interactive computing services (e.g., virtual desktops) brings new management challenges. Each interactive user leaves abundant but fluctuating residual resources while being intolerant to latency, precluding the use of aggressive VM consolidation. In this paper, we present the Resource Harvester for Interactive Clouds (RHIC), an autonomous management framework that harnesses dynamic residual resources aggressively without slowing the harvested interactive services. RHIC builds ad-hoc clusters for running throughput-oriented "background" workloads using a hybrid of residual and dedicated resources. These hybrid clusters offer significant gains over normal dedicated clusters: 20-40% cost and 20-29% energy savings in our test bed. For a given background job, RHIC intelligently discovers and maintains the ideal cluster size and composition, to meet user-specified goals such as cost/energy minimization or deadlines. RHIC employs black-box workload performance modeling, requiring only system-level metrics and incorporating techniques to improve modeling accuracy with bursty and heterogeneous residual resources. We demonstrate the effectiveness and adaptivity of our RHIC prototype with two parallel data analytics frameworks, Hadoop and HBase. Our results show that RHIC finds near-ideal cluster sizes and compositions across a wide range of workload/goal combinations.
Keywords :
cloud computing; interactive systems; power aware computing; resource allocation; software performance evaluation; virtual machines; workstation clusters; HBase; Hadoop; RHIC prototype; ad-hoc clusters; aggressive VM consolidation; autonomous management framework; batch analytics acceleration; black- box workload performance modeling; cloud-based interactive computing services; dedicated resources; dynamic residual resources; energy minimization; ideal cluster composition; ideal cluster size; parallel data analytics frameworks; resource harvester for interactive clouds; system-level metrics; throughput-oriented background workloads; virtual desktops; Adaptation models; Availability; Bandwidth; Measurement; Predictive models; Productivity; Virtual machine monitors; Adaptive systems; Distributed computing; Performance analysis;
Conference_Titel :
Modeling, Analysis & Simulation of Computer and Telecommunication Systems (MASCOTS), 2013 IEEE 21st International Symposium on
Conference_Location :
San Francisco, CA
DOI :
10.1109/MASCOTS.2013.63