• DocumentCode
    604046
  • Title

    Performance and Reliability Effects of Multi-tier Bidding on MapReduce in Auction-Based Clouds

  • Author

    Taifi, M. ; Shi, J.Y.

  • Author_Institution
    Comput. Sci. Dept., Temple Univ., Philadelphia, PA, USA
  • fYear
    2013
  • fDate
    25-28 March 2013
  • Firstpage
    61
  • Lastpage
    71
  • Abstract
    Hadoop has become a central big data processing framework in today´s cloud environments. Ensuring the good performance and cost effectiveness of Hadoop is crucial for the numerous applications that rely on it. In this paper we analyze Hadoop´s performance in a multi-tier market-oriented cloud infrastructure known as Spot Instances. Amazon Spot Instances (SIs) are designed to deliver a cheap but transient alternative to fixed cost On-Demand (ODIs) instances. Recently, AWS introduced SIs in their managed Elastic Map Reduce offering. This managed framework lets the users design a multi-tier Hadoop architecture using fine grained controls to define the instance types both in terms of capacity, i.e. compute/storage/network, but also in terms of costs, i.e. ODI vs SI. The performance effects of such fine grained configurations are not yet well understood. First, we analyze a set of cluster configurations that can lead to important performance effects that can affect both the running time and the cost of such cloud Hadoop clusters. Second, we examine Hadoop´s fault tolerance mechanisms and show the inadequacy of these mechanisms for multi-tier bidding architectures. Third, we discuss directions for making the Hadoop framework more market-aware without losing its focus on extreme scalability.
  • Keywords
    cloud computing; data handling; fault tolerant computing; Amazon Spot Instances; Hadoop fault tolerance mechanism; Hadoop framework; Hadoop performance analysis; MapReduce; ODI; auction-based cloud; central big data processing framework; cloud Hadoop cluster; cloud environment; cluster configuration; cost effectiveness; elastic Map Reduce offering; fine grained control; fixed cost on-demand instance; multitier Hadoop architecture; multitier bidding architecture; multitier market-oriented cloud infrastructure; Availability; Computer architecture; Fault tolerance; Fault tolerant systems; Runtime; Silicon; Auction-based Clouds; Failures; Fault Tolerance; Hadoop; Performance of Systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Service Oriented System Engineering (SOSE), 2013 IEEE 7th International Symposium on
  • Conference_Location
    Redwood City
  • Print_ISBN
    978-1-4673-5659-6
  • Type

    conf

  • DOI
    10.1109/SOSE.2013.13
  • Filename
    6525506