POSTER: Energy-performance tradeoffs in multilevel checkpoint strategies

Leonardo A.Bautista Gomez, Prasanna Balaprakash, Mohamed Slim Bouguerra, Stefan M. Wild, Franck Cappello, Paul D. Hovland

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Increased complexity of computer architectures, consideration of power constraints, and expected failure rates of hardware components make the design and analysis of energy-efficient fault-tolerance schemes an increasingly challenging and important task. We develop run-time and study FTI, a multilevel checkpoint library, on an IBM Blue Gene/Q. We show that FTI has a low energy footprint and that, consequently optimal checkpoint-interval values with respect to time and energy are similar.

Original languageEnglish (US)
Title of host publication2014 IEEE International Conference on Cluster Computing, CLUSTER 2014
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages278-279
Number of pages2
ISBN (Electronic)9781479955480
DOIs
StatePublished - Nov 26 2014
Event16th IEEE International Conference on Cluster Computing, CLUSTER 2014 - Madrid, Spain
Duration: Sep 22 2014Sep 26 2014

Publication series

Name2014 IEEE International Conference on Cluster Computing, CLUSTER 2014

Other

Other16th IEEE International Conference on Cluster Computing, CLUSTER 2014
Country/TerritorySpain
CityMadrid
Period9/22/149/26/14

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Science Applications
  • Software

Fingerprint

Dive into the research topics of 'POSTER: Energy-performance tradeoffs in multilevel checkpoint strategies'. Together they form a unique fingerprint.

Cite this