TY - GEN
T1 - High accuracy failure injection in parallel and distributed systems using virtualization
AU - Hérault, Thomas
AU - Largillier, Thomas
AU - Peyronnet, Sylvain
AU - Quétier, Benjamin
AU - Cappello, Franck
AU - Jan, Mathieu
PY - 2009
Y1 - 2009
N2 - Emulation sits between simulation and experimentation to complete the set of tools available for software designers to evaluate their software and predict behavior under conditions usually unachievable in a laboratory experiment. It consists in running the real application in an emulated environment. Thus, it behaves more realistically than a simulation, but under a controlled and reproducible environment, more suitable for behavior analysis. In this paper, we propose an emulation platform for parallel and distributed systems where both the machines and the network are virtualized at a low level. We demonstrate that the use of virtual machines allows us to test highly accurate failure injection by "destroying" virtual machines. Failure accuracy is a criteria that demonstrates how realistic a fault is. The platform accuracy is evaluated using Pastry, a fault-tolerant distributed hash-table.
AB - Emulation sits between simulation and experimentation to complete the set of tools available for software designers to evaluate their software and predict behavior under conditions usually unachievable in a laboratory experiment. It consists in running the real application in an emulated environment. Thus, it behaves more realistically than a simulation, but under a controlled and reproducible environment, more suitable for behavior analysis. In this paper, we propose an emulation platform for parallel and distributed systems where both the machines and the network are virtualized at a low level. We demonstrate that the use of virtual machines allows us to test highly accurate failure injection by "destroying" virtual machines. Failure accuracy is a criteria that demonstrates how realistic a fault is. The platform accuracy is evaluated using Pastry, a fault-tolerant distributed hash-table.
KW - Experimentation reliability measurement
UR - http://www.scopus.com/inward/record.url?scp=84858644930&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84858644930&partnerID=8YFLogxK
U2 - 10.1145/1531743.1531774
DO - 10.1145/1531743.1531774
M3 - Conference contribution
AN - SCOPUS:84858644930
SN - 9781605584133
T3 - Proceedings of the 6th ACM Conference on Computing Frontiers, CF 2009
SP - 193
EP - 196
BT - Proceedings of the 6th ACM Conference on Computing Frontiers, CF 2009
T2 - 6th ACM Conference on Computing Frontiers, CF 2009
Y2 - 18 May 2009 through 20 May 2009
ER -