An experimental evaluation of correlated network partitions in the coda distributed file system

Ryan M. Lefever, Michel Cukier, William H. Sanders

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Experimental evaluation is an important way to assess distributed systems, and fault injection is the dominant technique in this area for the evaluation of a system's dependability. For distributed systems, network failure is an important fault model. Physical network failures often have far-reaching effects, giving rise to multiple correlated failures as seen by higher-level protocols. This paper presents an experimental evaluation, using the Loki fault injector, that provides insight into the impact that correlated network partitions have on the Coda distributed file system. In this evaluation, Loki created a network partition between two Coda file servers, during which updates were made at each server to the same replicated data volume. Upon repair of the partition, a client requested directory resolution to converge the diverging replicas. At various stages of the resolution, Loki invoked a second correlated network partition, thus allowing us to evaluate its impact on the system's correctness, performance, and availability.

Original languageEnglish (US)
Title of host publicationProceedings - 22nd International Symposium on Reliable Distributed Systems, SRDS 2003
Pages273-282
Number of pages10
DOIs
StatePublished - Dec 1 2003
Event22nd International Symposium on Reliable Distributed Systems, SRDS 2003 - Florence, Italy
Duration: Oct 6 2003Oct 8 2003

Publication series

NameProceedings of the IEEE Symposium on Reliable Distributed Systems
ISSN (Print)1060-9857

Other

Other22nd International Symposium on Reliable Distributed Systems, SRDS 2003
CountryItaly
CityFlorence
Period10/6/0310/8/03

Keywords

  • Coda
  • Correlated network faults
  • Distributed file system
  • Experimental evaluation
  • Fault injection
  • Loki fault injector
  • State-driven fault injection

ASJC Scopus subject areas

  • Software
  • Theoretical Computer Science
  • Hardware and Architecture
  • Computer Networks and Communications

Fingerprint Dive into the research topics of 'An experimental evaluation of correlated network partitions in the coda distributed file system'. Together they form a unique fingerprint.

  • Cite this

    Lefever, R. M., Cukier, M., & Sanders, W. H. (2003). An experimental evaluation of correlated network partitions in the coda distributed file system. In Proceedings - 22nd International Symposium on Reliable Distributed Systems, SRDS 2003 (pp. 273-282). (Proceedings of the IEEE Symposium on Reliable Distributed Systems). https://doi.org/10.1109/RELDIS.2003.1238077