• DocumentCode
    714243
  • Title

    Spotgres - parallel data analytics on Spot Instances

  • Author

    Binnig, Carsten ; Salama, Abdallah ; Zamanian, Erfan ; El-Hindi, Muhammad ; Feil, Sebastian ; Ziegler, Tobias

  • Author_Institution
    Cooperative State Univ. of Baden-Wuerttemberg, Mannheim, Germany
  • fYear
    2015
  • fDate
    13-17 April 2015
  • Firstpage
    14
  • Lastpage
    21
  • Abstract
    Market-based IaaS offers such as Amazon´s EC2 Spot Instances represent a cost-efficient way to operate a cluster. Compared to traditional IaaS offers which follow a fixed pricing scheme, the per hour price of Spot Instances changes dynamically, whereas the Spot price is often significantly less when compared to On-demand and even the Reserved Instances. When deploying a Parallel Data-Processing Engine (PDE) on a cluster of Spot Instances a major obstacle is to find a bidding strategy that is optimal for a given workload and satisfies user constraints such as the maximal budget. Moreover, another obstacle is that existing PDEs implement rigid fault-tolerance schemes which do not adapt to different failure rates resulting from different bidding strategies. In this paper, we present a novel PDE called Spotgres that tackles these issues. Spotgres extends a typical PDE architecture by (1) a constraint-based bid advisor which finds an optimal cluster configuration (i.e., a set of bids on Spot Instances) and (2) a cost-based fault-tolerance scheme that takes various parameters (such as the mean time between failures and query statistics) into account to efficiently execute analytical queries over the set of Spot Instances that have a varying failure rate.
  • Keywords
    cloud computing; data analysis; Amazon EC2 Spot Instances; PDE; Spotgres; constraint-based bid advisor; cost-based fault-tolerance scheme; market-based IaaS; parallel data analytics; parallel data-processing engine; Fault tolerance; Fault tolerant systems; History; Linear programming; Monitoring; Optimization; Runtime;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering Workshops (ICDEW), 2015 31st IEEE International Conference on
  • Conference_Location
    Seoul
  • Type

    conf

  • DOI
    10.1109/ICDEW.2015.7129538
  • Filename
    7129538