An object-oriented testbed for the evaluation of checkpointing and recovery systems

B. Ramamurthy, S. J. Upadhyaya, R. K. Iyer

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The paper presents the design and development of an object-oriented testbed for simulation and analysis of checkpointing and recovery schemes in distributed systems. An important contribution, of the testbed is a unified environment that provides a set of specialized components for easy and detailed simulation of checkpointing and recovery schemes. The testbed allows a designer to mix and match different components either to study the effectiveness of a particular scheme or to freely experiment with hybrid designs before the actual implementation. The testbed also facilitates the evaluation of interdependencies among the various parameters such as communication and application dynamics and their effect on the performance of checkpointing and recovery schemes. The implementation of the testbed as an extension of DEPEND which is an integrated design and fault-injection environment, provides for unique system-level dependability analysis under realistic fault conditions unlike existing simulation tools. The authors illustrate the versatility of the testbed by using four diverse applications, ranging from the comparison of performances of two checkpointing and recovery schemes to the study of the effect of checkpoint size.

Original languageEnglish (US)
Title of host publicationDigest of Papers - 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages194-203
Number of pages10
ISBN (Electronic)0818678313, 9780818678318
DOIs
StatePublished - 1997
Event27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997 - Seattle, United States
Duration: Jun 24 1997Jun 27 1997

Publication series

NameDigest of Papers - 27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997

Other

Other27th Annual International Symposium on Fault-Tolerant Computing, FTCS 1997
Country/TerritoryUnited States
CitySeattle
Period6/24/976/27/97

ASJC Scopus subject areas

  • Computer Science Applications
  • Hardware and Architecture
  • Software
  • Safety, Risk, Reliability and Quality

Fingerprint

Dive into the research topics of 'An object-oriented testbed for the evaluation of checkpointing and recovery systems'. Together they form a unique fingerprint.

Cite this