• DocumentCode
    656177
  • Title

    Characterizing Cloud Applications on a Google Data Center

  • Author

    Sheng Di ; Kondo, Daishi ; Cappello, Franck

  • Author_Institution
    INRIA, Saclay, France
  • fYear
    2013
  • fDate
    1-4 Oct. 2013
  • Firstpage
    468
  • Lastpage
    473
  • Abstract
    In this paper, we characterize Google applications, based on a one-month Google trace with over 650k jobs running across over 12000 heterogeneous hosts from a Google data center. On one hand, we carefully compute the valuable statistics about task events and resource utilization for Google applications, based on various types of resources (such as CPU, memory) and execution types (e.g., whether they can run batch tasks or not). Resource utilization per application is observed with an extremely typical Pareto principle. On the other hand, we classify applications via a K-means clustering algorithm with optimized number of sets, based on task events and resource usage. The number of applications in the K-means clustering sets follows a Pareto-similar distribution. We believe our work is very interesting and valuable for the further investigation of Cloud environment.
  • Keywords
    Pareto distribution; cloud computing; computer centres; pattern clustering; resource allocation; search engines; Google applications; Google data center; Google trace; K-means clustering sets; Pareto principle; Pareto-similar distribution; cloud applications; cloud environment; execution types; resource types; resource utilization; statistics; task events; Cloud computing; Clustering algorithms; Computational modeling; Google; Measurement; Resource management; Scheduling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing (ICPP), 2013 42nd International Conference on
  • Conference_Location
    Lyon
  • ISSN
    0190-3918
  • Type

    conf

  • DOI
    10.1109/ICPP.2013.56
  • Filename
    6687380