TY - GEN
T1 - Characterizing cloud applications on a Google data center
AU - Di, Sheng
AU - Kondo, Derrick
AU - Cappello, Franck
PY - 2013
Y1 - 2013
N2 - In this paper, we characterize Google applications, based on a one-month Google trace with over 650k jobs running across over 12000 heterogeneous hosts from a Google data center. On one hand, we carefully compute the valuable statistics about task events and resource utilization for Google applications, based on various types of resources (such as CPU, memory) and execution types (e.g., whether they can run batch tasks or not). Resource utilization per application is observed with an extremely typical Pareto principle. On the other hand, we classify applications via a K-means clustering algorithm with optimized number of sets, based on task events and resource usage. The number of applications in the Kmeans clustering sets follows a Pareto-similar distribution. We believe our work is very interesting and valuable for the further investigation of Cloud environment.
AB - In this paper, we characterize Google applications, based on a one-month Google trace with over 650k jobs running across over 12000 heterogeneous hosts from a Google data center. On one hand, we carefully compute the valuable statistics about task events and resource utilization for Google applications, based on various types of resources (such as CPU, memory) and execution types (e.g., whether they can run batch tasks or not). Resource utilization per application is observed with an extremely typical Pareto principle. On the other hand, we classify applications via a K-means clustering algorithm with optimized number of sets, based on task events and resource usage. The number of applications in the Kmeans clustering sets follows a Pareto-similar distribution. We believe our work is very interesting and valuable for the further investigation of Cloud environment.
UR - http://www.scopus.com/inward/record.url?scp=84893269298&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84893269298&partnerID=8YFLogxK
U2 - 10.1109/ICPP.2013.56
DO - 10.1109/ICPP.2013.56
M3 - Conference contribution
AN - SCOPUS:84893269298
SN - 9780769551173
T3 - Proceedings of the International Conference on Parallel Processing
SP - 468
EP - 473
BT - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 42nd Annual International Conference on Parallel Processing, ICPP 2013
Y2 - 1 October 2013 through 4 October 2013
ER -