Protocol-aware recovery for consensus-based storage

Ramnatthan Alagappan, Aishwarya Ganesan, Eric Lee, Aws Albarghouthi, Vijay Chidambaram, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We introduce protocol-aware recovery (PAR), a new approach that exploits protocol-specific knowledge to correctly recover from storage faults in distributed systems. We demonstrate the efficacy of PAR through the design and implementation of corruption-tolerant replication (CTRL), a PAR mechanism specific to replicated state machine (RSM) systems. We experimentally show that the CTRL versions of two systems, LogCabin and ZooKeeper, safely recover from storage faults and provide high availability, while the unmodified versions can lose data or become unavailable. We also show that the CTRL versions have little performance overhead.

Original languageEnglish (US)
Title of host publicationProceedings of the 16th USENIX Conference on File and Storage Technologies, FAST 2018
PublisherUSENIX Association
Pages15-31
Number of pages17
ISBN (Electronic)9781931971423
StatePublished - 2018
Externally publishedYes
Event16th USENIX Conference on File and Storage Technologies, FAST 2018 - Oakland, United States
Duration: Feb 12 2018Feb 15 2018

Conference

Conference16th USENIX Conference on File and Storage Technologies, FAST 2018
Country/TerritoryUnited States
CityOakland
Period2/12/182/15/18

ASJC Scopus subject areas

  • Hardware and Architecture
  • Software
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Protocol-aware recovery for consensus-based storage'. Together they form a unique fingerprint.

Cite this