ReViveI/O: Efficient handling of I/O in highly-available rollback-recovery servers

Jun Nakano, Pablo Montesinos, Kourosh Gharachorloo, Josep Torrellas

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The increasing demand for reliable computers has led to proposals for hardware-assisted rollback of memory state. Such approach promises major reductions in Mean Time To Repair (MTTR). The benefits are especially compelling for database servers, where existing recovery software typically leads to down-times of tens of minutes. Unfortunately, adoption of such proposals is hindered by the lack of efficient mechanisms for I/O recovery. This paper presents and evaluates ReViveI/O, a scheme for I/O undo and redo that is compatible with mechanisms for hardware-assisted rollback of memory state. We have fully implemented a Linux-based prototype that shows that low-overhead, low-MTTR recovery of I/O is feasible. For 20-120 ms between checkpoints, a throughput-oriented workload such as TPC-C has negligible over-head. Moreover, for 50 ms or less between checkpoints, the response time of a latency-bound workload such as WebStone remains tolerable. In all cases, the recovery time of ReViveI/O is practically negligible. The result is a cost-effective highly-available server.

Original languageEnglish (US)
Title of host publicationProceedings - Twelfth International Symposium on High-Performance Computer Architecture, 2006
Pages203-214
Number of pages12
DOIs
StatePublished - 2006
EventTwelfth International Symposium on High-Performance Computer Architecture, 2006 - Austin, TX, United States
Duration: Feb 11 2006Feb 15 2006

Publication series

NameProceedings - International Symposium on High-Performance Computer Architecture
Volume2006
ISSN (Print)1530-0897

Other

OtherTwelfth International Symposium on High-Performance Computer Architecture, 2006
Country/TerritoryUnited States
CityAustin, TX
Period2/11/062/15/06

ASJC Scopus subject areas

  • General Engineering

Fingerprint

Dive into the research topics of 'ReViveI/O: Efficient handling of I/O in highly-available rollback-recovery servers'. Together they form a unique fingerprint.

Cite this