Staggered consistent checkpointing

Nitin H. Vaidya

Research output: Contribution to journalArticlepeer-review

Abstract

A consistent checkpointing algorithm saves a consistent view of a distributed application's state on stable storage. The traditional consistent checkpointing algorithms require different processes to save their state at about the same time. This causes contention for the stable storage, potentially resulting in large overheads. Staggering the checkpoints taken by various processes can reduce checkpoint overhead. This paper presents a simple approach to arbitrarily stagger the checkpoints. Our approach requires that the processes take consistent logical checkpoints, as compared to consistent physical checkpoints enforced by existing algorithms. Experimental results on nCube-2 are presented.

Original languageEnglish (US)
Pages (from-to)694-702
Number of pages9
JournalIEEE Transactions on Parallel and Distributed Systems
Volume10
Issue number7
DOIs
StatePublished - 1999
Externally publishedYes

ASJC Scopus subject areas

  • Signal Processing
  • Hardware and Architecture
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'Staggered consistent checkpointing'. Together they form a unique fingerprint.

Cite this