DocumentCode
656177
Title
Characterizing Cloud Applications on a Google Data Center
Author
Sheng Di ; Kondo, Daishi ; Cappello, Franck
Author_Institution
INRIA, Saclay, France
fYear
2013
fDate
1-4 Oct. 2013
Firstpage
468
Lastpage
473
Abstract
In this paper, we characterize Google applications, based on a one-month Google trace with over 650k jobs running across over 12000 heterogeneous hosts from a Google data center. On one hand, we carefully compute the valuable statistics about task events and resource utilization for Google applications, based on various types of resources (such as CPU, memory) and execution types (e.g., whether they can run batch tasks or not). Resource utilization per application is observed with an extremely typical Pareto principle. On the other hand, we classify applications via a K-means clustering algorithm with optimized number of sets, based on task events and resource usage. The number of applications in the K-means clustering sets follows a Pareto-similar distribution. We believe our work is very interesting and valuable for the further investigation of Cloud environment.
Keywords
Pareto distribution; cloud computing; computer centres; pattern clustering; resource allocation; search engines; Google applications; Google data center; Google trace; K-means clustering sets; Pareto principle; Pareto-similar distribution; cloud applications; cloud environment; execution types; resource types; resource utilization; statistics; task events; Cloud computing; Clustering algorithms; Computational modeling; Google; Measurement; Resource management; Scheduling;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Processing (ICPP), 2013 42nd International Conference on
Conference_Location
Lyon
ISSN
0190-3918
Type
conf
DOI
10.1109/ICPP.2013.56
Filename
6687380
Link To Document