• DocumentCode
    2784654
  • Title

    A Performance Interference Model for Managing Consolidated Workloads in QoS-Aware Clouds

  • Author

    Zhu, Qian ; Tung, Teresa

  • Author_Institution
    Accenture Technol. Labs., San Jose, CA, USA
  • fYear
    2012
  • fDate
    24-29 June 2012
  • Firstpage
    170
  • Lastpage
    179
  • Abstract
    Cloud computing offers users the ability to access large pools of computational and storage resources on-demand without the burden of managing and maintaining their own IT assets. Today´s cloud providers charge users based upon the amount of resources used or reserved, with only minimal guarantees of the quality-of-service (QoS) experienced byte users applications. As virtualization technologies proliferate among cloud providers, consolidating multiple user applications onto multi-core servers increases revenue and improves resource utilization. However, consolidation introduces performance interference between co-located workloads, which significantly impacts application QoS. A critical requirement for effective consolidation is to be able to predict the impact of application performance in the presence of interference from on-chip resources, e.g., CPU and last-level cache (LLC)/memory bandwidth sharing, to storage devices and network bandwidth contention. In this work, we propose an interference model which predicts the application QoS metric. The key distinctive feature is the consideration of time-variant inter-dependency among different levels of resource interference. We use applications from a test suite and SPECWeb2005 to illustrate the effectiveness of our model and an average prediction error of less than 8% is achieved. Furthermore, we demonstrate using the proposed interference model to optimize the cloud provider´s metric (here the number of successfully executed applications) to realize better workload placement decisions and thereby maintaining the user´s application QoS.
  • Keywords
    cache storage; cloud computing; quality of service; resource allocation; virtual machines; CPU; IT asset; QoS-aware cloud; SPECWeb2005; application QoS metric prediction; cloud computing; cloud providers; colocated workload; computational resource; consolidated workload management; last-level cache sharing; memory bandwidth sharing; multicore server; network bandwidth; on-chip resources; on-demand resource access; performance interference model; quality-of-service; resource interference; resource utilization; storage devices; storage resource; time-variant interdependency; user applications; virtualization technology; workload placement decision; Computational modeling; Degradation; Hidden Markov models; Interference; Measurement; Quality of service; Servers; Cloud computing; QoS-aware; performance interference;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing (CLOUD), 2012 IEEE 5th International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    2159-6182
  • Print_ISBN
    978-1-4673-2892-0
  • Type

    conf

  • DOI
    10.1109/CLOUD.2012.25
  • Filename
    6253503