TY - JOUR
T1 - Dependability evaluation using composed SAN-based reward models
AU - Sanders, William H.
AU - Malhis, Luai M.
N1 - Funding Information:
* This work was funded in part by the Digital Equipment Corporation Faculty Program: Incentives for Excellence, Intel Corporation, and US West Advanced Technologies.
Copyright:
Copyright 2014 Elsevier B.V., All rights reserved.
PY - 1992/7
Y1 - 1992/7
N2 - Dependability evaluation is an important, but difficult, aspect of the design of fault-tolerant parallel and distributed computing systems. One possible technique is to use Markov models but, if applied directly to realistic designs, this often results in large and intractable models. Many authors have investigated methods to avoid this explosive state-space growth, but have typically either solved the problem for a specific system design, or required manipulation of the model at the state-space level. Stochastic activity networks (SANs), a stochastic extension of Petri nets, together with recently developed reduced base model construction techniques, have the potential to avoid this state-space growth at the SAN level for many parallel and distributed systems. This paper investigates this claim by considering their application to three different systems: a fault-tolerant parallel computing system, a distributed database architecture, and a multiprocessor-multimemory system. We show that this method does indeed result in tractable Markov models for these systems, and argue that it can be applied to the dependability evaluation of many parallel and distributed systems.
AB - Dependability evaluation is an important, but difficult, aspect of the design of fault-tolerant parallel and distributed computing systems. One possible technique is to use Markov models but, if applied directly to realistic designs, this often results in large and intractable models. Many authors have investigated methods to avoid this explosive state-space growth, but have typically either solved the problem for a specific system design, or required manipulation of the model at the state-space level. Stochastic activity networks (SANs), a stochastic extension of Petri nets, together with recently developed reduced base model construction techniques, have the potential to avoid this state-space growth at the SAN level for many parallel and distributed systems. This paper investigates this claim by considering their application to three different systems: a fault-tolerant parallel computing system, a distributed database architecture, and a multiprocessor-multimemory system. We show that this method does indeed result in tractable Markov models for these systems, and argue that it can be applied to the dependability evaluation of many parallel and distributed systems.
UR - http://www.scopus.com/inward/record.url?scp=35048841008&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=35048841008&partnerID=8YFLogxK
U2 - 10.1016/0743-7315(92)90006-9
DO - 10.1016/0743-7315(92)90006-9
M3 - Article
AN - SCOPUS:35048841008
SN - 0743-7315
VL - 15
SP - 238
EP - 254
JO - Journal of Parallel and Distributed Computing
JF - Journal of Parallel and Distributed Computing
IS - 3
ER -