"Cool" Load balancing for high performance computing data centers

Osman Sarood, Phil Miller, Ehsan Totoni, Laxmikant V. Kalé

Research output: Contribution to journalArticlepeer-review

Abstract

As we move to exascale machines, both peak power demand and total energy consumption have become prominent challenges. A significant portion of that power and energy consumption is devoted to cooling, which we strive to minimize in this work. We propose a scheme based on a combination of limiting processor temperatures using dynamic voltage and frequency scaling (DVFS) and frequency-aware load balancing that reduces cooling energy consumption and prevents hot spot formation. Our approach is particularly designed for parallel applications, which are typically tightly coupled, and tries to minimize the timing penalty associated with temperature control. This paper describes results from experiments using five different Charm++ and MPI applications with a range of power and utilization profiles. They were run on a 32-node (128-core) cluster with a dedicated air conditioning unit. The scheme is assessed based on three metrics: the ability to control processors' temperature and hence avoid hot spots, minimization of timing penalty, and cooling energy savings. Our results show cooling energy savings of up to 63 percent, with a timing penalty of only 2-23 percent.

Original languageEnglish (US)
Article number6226358
Pages (from-to)1752-1764
Number of pages13
JournalIEEE Transactions on Computers
Volume61
Issue number12
DOIs
StatePublished - 2012

Keywords

  • DVFS
  • Green IT
  • cooling energy
  • load balancing
  • temperature aware

ASJC Scopus subject areas

  • Software
  • Theoretical Computer Science
  • Hardware and Architecture
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of '"Cool" Load balancing for high performance computing data centers'. Together they form a unique fingerprint.

Cite this