Performance analysis of two time-based coordinated checkpointing protocols

G. P. Kavanaugh, W. H. Sanders

Research output: Contribution to conferencePaper

Abstract

Time-based checkpointing protocols are a recently proposed way to improve a system's dependability. They claim to have the advantages of coordinated protocols without the normal costs of coordination. This paper investigates that claim, by analyzing and comparing two time-based checkpointing protocols. The analysis is performed by determining the forward progress of a system using each protocol, and it is described in such a way as to be easily modifiable for other time-based protocols. By carefully analyzing the behavior of each protocol between renewal points, we are able to obtain a closed-form expression for the forward progress of the two protocols considered. We also determine the checkpoint interval value that will maximize forward progress. A validation of the analytical model is then performed via a detailed simulation. The results obtained from the model show the advantages and disadvantages of each protocol.

Original languageEnglish (US)
Pages194-201
Number of pages8
StatePublished - Dec 1 1997
EventProceedings of the 1997 Pacific Rim International Symposium on Fault-Tolerant Systems, PRFTS - Taipei, Taiwan
Duration: Dec 15 1997Dec 16 1997

Other

OtherProceedings of the 1997 Pacific Rim International Symposium on Fault-Tolerant Systems, PRFTS
CityTaipei, Taiwan
Period12/15/9712/16/97

Fingerprint

Analytical models
Costs

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Kavanaugh, G. P., & Sanders, W. H. (1997). Performance analysis of two time-based coordinated checkpointing protocols. 194-201. Paper presented at Proceedings of the 1997 Pacific Rim International Symposium on Fault-Tolerant Systems, PRFTS, Taipei, Taiwan, .

Performance analysis of two time-based coordinated checkpointing protocols. / Kavanaugh, G. P.; Sanders, W. H.

1997. 194-201 Paper presented at Proceedings of the 1997 Pacific Rim International Symposium on Fault-Tolerant Systems, PRFTS, Taipei, Taiwan, .

Research output: Contribution to conferencePaper

Kavanaugh, GP & Sanders, WH 1997, 'Performance analysis of two time-based coordinated checkpointing protocols', Paper presented at Proceedings of the 1997 Pacific Rim International Symposium on Fault-Tolerant Systems, PRFTS, Taipei, Taiwan, 12/15/97 - 12/16/97 pp. 194-201.
Kavanaugh GP, Sanders WH. Performance analysis of two time-based coordinated checkpointing protocols. 1997. Paper presented at Proceedings of the 1997 Pacific Rim International Symposium on Fault-Tolerant Systems, PRFTS, Taipei, Taiwan, .
Kavanaugh, G. P. ; Sanders, W. H. / Performance analysis of two time-based coordinated checkpointing protocols. Paper presented at Proceedings of the 1997 Pacific Rim International Symposium on Fault-Tolerant Systems, PRFTS, Taipei, Taiwan, .8 p.
@conference{699b4bcfcdd14194a76b7693ef8c8559,
title = "Performance analysis of two time-based coordinated checkpointing protocols",
abstract = "Time-based checkpointing protocols are a recently proposed way to improve a system's dependability. They claim to have the advantages of coordinated protocols without the normal costs of coordination. This paper investigates that claim, by analyzing and comparing two time-based checkpointing protocols. The analysis is performed by determining the forward progress of a system using each protocol, and it is described in such a way as to be easily modifiable for other time-based protocols. By carefully analyzing the behavior of each protocol between renewal points, we are able to obtain a closed-form expression for the forward progress of the two protocols considered. We also determine the checkpoint interval value that will maximize forward progress. A validation of the analytical model is then performed via a detailed simulation. The results obtained from the model show the advantages and disadvantages of each protocol.",
author = "Kavanaugh, {G. P.} and Sanders, {W. H.}",
year = "1997",
month = "12",
day = "1",
language = "English (US)",
pages = "194--201",
note = "Proceedings of the 1997 Pacific Rim International Symposium on Fault-Tolerant Systems, PRFTS ; Conference date: 15-12-1997 Through 16-12-1997",

}

TY - CONF

T1 - Performance analysis of two time-based coordinated checkpointing protocols

AU - Kavanaugh, G. P.

AU - Sanders, W. H.

PY - 1997/12/1

Y1 - 1997/12/1

N2 - Time-based checkpointing protocols are a recently proposed way to improve a system's dependability. They claim to have the advantages of coordinated protocols without the normal costs of coordination. This paper investigates that claim, by analyzing and comparing two time-based checkpointing protocols. The analysis is performed by determining the forward progress of a system using each protocol, and it is described in such a way as to be easily modifiable for other time-based protocols. By carefully analyzing the behavior of each protocol between renewal points, we are able to obtain a closed-form expression for the forward progress of the two protocols considered. We also determine the checkpoint interval value that will maximize forward progress. A validation of the analytical model is then performed via a detailed simulation. The results obtained from the model show the advantages and disadvantages of each protocol.

AB - Time-based checkpointing protocols are a recently proposed way to improve a system's dependability. They claim to have the advantages of coordinated protocols without the normal costs of coordination. This paper investigates that claim, by analyzing and comparing two time-based checkpointing protocols. The analysis is performed by determining the forward progress of a system using each protocol, and it is described in such a way as to be easily modifiable for other time-based protocols. By carefully analyzing the behavior of each protocol between renewal points, we are able to obtain a closed-form expression for the forward progress of the two protocols considered. We also determine the checkpoint interval value that will maximize forward progress. A validation of the analytical model is then performed via a detailed simulation. The results obtained from the model show the advantages and disadvantages of each protocol.

UR - http://www.scopus.com/inward/record.url?scp=0031341097&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0031341097&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:0031341097

SP - 194

EP - 201

ER -