TY - GEN
T1 - An experimental evaluation of correlated network partitions in the coda distributed file system
AU - Lefever, Ryan M.
AU - Cukier, Michel
AU - Sanders, William H.
PY - 2003
Y1 - 2003
N2 - Experimental evaluation is an important way to assess distributed systems, and fault injection is the dominant technique in this area for the evaluation of a system's dependability. For distributed systems, network failure is an important fault model. Physical network failures often have far-reaching effects, giving rise to multiple correlated failures as seen by higher-level protocols. This paper presents an experimental evaluation, using the Loki fault injector, that provides insight into the impact that correlated network partitions have on the Coda distributed file system. In this evaluation, Loki created a network partition between two Coda file servers, during which updates were made at each server to the same replicated data volume. Upon repair of the partition, a client requested directory resolution to converge the diverging replicas. At various stages of the resolution, Loki invoked a second correlated network partition, thus allowing us to evaluate its impact on the system's correctness, performance, and availability.
AB - Experimental evaluation is an important way to assess distributed systems, and fault injection is the dominant technique in this area for the evaluation of a system's dependability. For distributed systems, network failure is an important fault model. Physical network failures often have far-reaching effects, giving rise to multiple correlated failures as seen by higher-level protocols. This paper presents an experimental evaluation, using the Loki fault injector, that provides insight into the impact that correlated network partitions have on the Coda distributed file system. In this evaluation, Loki created a network partition between two Coda file servers, during which updates were made at each server to the same replicated data volume. Upon repair of the partition, a client requested directory resolution to converge the diverging replicas. At various stages of the resolution, Loki invoked a second correlated network partition, thus allowing us to evaluate its impact on the system's correctness, performance, and availability.
KW - Coda
KW - Correlated network faults
KW - Distributed file system
KW - Experimental evaluation
KW - Fault injection
KW - Loki fault injector
KW - State-driven fault injection
UR - http://www.scopus.com/inward/record.url?scp=3242675882&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=3242675882&partnerID=8YFLogxK
U2 - 10.1109/RELDIS.2003.1238077
DO - 10.1109/RELDIS.2003.1238077
M3 - Conference contribution
AN - SCOPUS:3242675882
SN - 0769519555
T3 - Proceedings of the IEEE Symposium on Reliable Distributed Systems
SP - 273
EP - 282
BT - Proceedings - 22nd International Symposium on Reliable Distributed Systems, SRDS 2003
T2 - 22nd International Symposium on Reliable Distributed Systems, SRDS 2003
Y2 - 6 October 2003 through 8 October 2003
ER -