مرکز منطقه ای اطلاع رساني علوم و فناوري - Banking on Decoupling: Budget-Driven Sustainability for HPC Applications on EC2 Spot Instances

DocumentCode :

3144794

Title :

Banking on Decoupling: Budget-Driven Sustainability for HPC Applications on EC2 Spot Instances

Author :

Taifi, M.

Author_Institution :

Comput. & Inf. Sci. Dept., Temple Univ., Philadelphia, PA, USA

fYear :

2012

fDate :

8-11 Oct. 2012

Firstpage :

442

Lastpage :

447

Abstract :

Cloud providers are auctioning their excess capacity using dynamically priced virtual instances. These spot instances provide significant savings compared to on-demand or fixed price instances. The users willing to use these resources are asked to provide a maximum bid price per hour, and the cloud provider runs the instances as long as the market price is below the user´s bid price. By using such resources, the users are exposed explicitly to failures and need to adapt their applications to provide some level of fault tolerance. In this paper we expose the effect of bidding in the case of virtual HPC clusters composed of spot instances. We describe the interesting effect of uniform versus non-uniform bidding, in terms of failure rate and failure model. We propose an initial attempt to deal with the problem of predicting the runtime of a parallel application under various bidding strategies and various system parameters. We describe the relationship between bidding strategies and programming models. We build a preliminary optimization model that uses real price traces from Amazon Web Services as inputs, as well as instrumented values related to the processing and network capacities of clusters instances on the EC2 services. Our results show preliminary insights into the relationship between non-uniform bidding and application scaling strategies.

Keywords :

Web services; budgeting; fault tolerance; pricing; sustainable development; Amazon Web service; EC2 spot instances; HPC application; application scaling strategy; banking; bidding strategy; budget driven sustainability; cloud providers; decoupling; excess capacity; failure model; failure rate; fault tolerance; fixed price instance; maximum bid price; nonuniform bidding; on demand; parallel application; preliminary optimization model; priced virtual instance; programming model; virtual HPC clusters; Approximation methods; Computational modeling; Fault tolerance; Fault tolerant systems; Programming; Runtime; Timing; Auction-based cloud computing; Cloud virtual clusters; Cloud-based Fault Tolerance; Cost-aware Optimization models; Decoupling Parallel Programming Models; Spot Instances;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Reliable Distributed Systems (SRDS), 2012 IEEE 31st Symposium on

Conference_Location :

Irvine, CA

ISSN :

1060-9857

Print_ISBN :

978-1-4673-2397-0

Type :

conf

DOI :

10.1109/SRDS.2012.11

Filename :

6424892

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3144794