Abstract

This paper presents a distributed fault injection and monitoring environment (DEFINE) as a tool to evaluate system dependability, to investigate fault propagation, and to validate fault-tolerant mechanisms. DEFINE can inject both hardware faults (hardware-induced software errors) and software faults into any process running in a distributed system, either in user mode or in supervisor mode, and monitor the fault impact and propagation in software systems and among machines. It employs two fault injection techniques: (i) using hardware clock interrupts to control the time of fault injection and activation, and (ii) using software traps to inject all the faults except communication faults and memory faults in the data/stack segment. Experiments on six Sun SPARCstations to study the system behavior under faults are conducted to demonstrate the application of DEFINE.

Original languageEnglish (US)
Pages252-259
Number of pages8
StatePublished - Dec 1 1995
EventProceedings of the 1995 Fault-Tolerant Parallel and Distributed Systems - Galveston, TX, USA
Duration: Jun 13 1994Jun 14 1994

Other

OtherProceedings of the 1995 Fault-Tolerant Parallel and Distributed Systems
CityGalveston, TX, USA
Period6/13/946/14/94

Fingerprint

Hardware
Monitoring
Supervisory personnel
Sun
Clocks
Chemical activation
Data storage equipment
Communication
Experiments

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Kao, W. L., & Iyer, R. K. (1995). DEFINE: a distributed fault injection and monitoring environment. 252-259. Paper presented at Proceedings of the 1995 Fault-Tolerant Parallel and Distributed Systems, Galveston, TX, USA, .

DEFINE : a distributed fault injection and monitoring environment. / Kao, Wei lun; Iyer, Ravishankar K.

1995. 252-259 Paper presented at Proceedings of the 1995 Fault-Tolerant Parallel and Distributed Systems, Galveston, TX, USA, .

Research output: Contribution to conferencePaper

Kao, WL & Iyer, RK 1995, 'DEFINE: a distributed fault injection and monitoring environment', Paper presented at Proceedings of the 1995 Fault-Tolerant Parallel and Distributed Systems, Galveston, TX, USA, 6/13/94 - 6/14/94 pp. 252-259.
Kao WL, Iyer RK. DEFINE: a distributed fault injection and monitoring environment. 1995. Paper presented at Proceedings of the 1995 Fault-Tolerant Parallel and Distributed Systems, Galveston, TX, USA, .
Kao, Wei lun ; Iyer, Ravishankar K. / DEFINE : a distributed fault injection and monitoring environment. Paper presented at Proceedings of the 1995 Fault-Tolerant Parallel and Distributed Systems, Galveston, TX, USA, .8 p.
@conference{d1b44be9356f42708b8cd29d569389ac,
title = "DEFINE: a distributed fault injection and monitoring environment",
abstract = "This paper presents a distributed fault injection and monitoring environment (DEFINE) as a tool to evaluate system dependability, to investigate fault propagation, and to validate fault-tolerant mechanisms. DEFINE can inject both hardware faults (hardware-induced software errors) and software faults into any process running in a distributed system, either in user mode or in supervisor mode, and monitor the fault impact and propagation in software systems and among machines. It employs two fault injection techniques: (i) using hardware clock interrupts to control the time of fault injection and activation, and (ii) using software traps to inject all the faults except communication faults and memory faults in the data/stack segment. Experiments on six Sun SPARCstations to study the system behavior under faults are conducted to demonstrate the application of DEFINE.",
author = "Kao, {Wei lun} and Iyer, {Ravishankar K.}",
year = "1995",
month = "12",
day = "1",
language = "English (US)",
pages = "252--259",
note = "Proceedings of the 1995 Fault-Tolerant Parallel and Distributed Systems ; Conference date: 13-06-1994 Through 14-06-1994",

}

TY - CONF

T1 - DEFINE

T2 - a distributed fault injection and monitoring environment

AU - Kao, Wei lun

AU - Iyer, Ravishankar K.

PY - 1995/12/1

Y1 - 1995/12/1

N2 - This paper presents a distributed fault injection and monitoring environment (DEFINE) as a tool to evaluate system dependability, to investigate fault propagation, and to validate fault-tolerant mechanisms. DEFINE can inject both hardware faults (hardware-induced software errors) and software faults into any process running in a distributed system, either in user mode or in supervisor mode, and monitor the fault impact and propagation in software systems and among machines. It employs two fault injection techniques: (i) using hardware clock interrupts to control the time of fault injection and activation, and (ii) using software traps to inject all the faults except communication faults and memory faults in the data/stack segment. Experiments on six Sun SPARCstations to study the system behavior under faults are conducted to demonstrate the application of DEFINE.

AB - This paper presents a distributed fault injection and monitoring environment (DEFINE) as a tool to evaluate system dependability, to investigate fault propagation, and to validate fault-tolerant mechanisms. DEFINE can inject both hardware faults (hardware-induced software errors) and software faults into any process running in a distributed system, either in user mode or in supervisor mode, and monitor the fault impact and propagation in software systems and among machines. It employs two fault injection techniques: (i) using hardware clock interrupts to control the time of fault injection and activation, and (ii) using software traps to inject all the faults except communication faults and memory faults in the data/stack segment. Experiments on six Sun SPARCstations to study the system behavior under faults are conducted to demonstrate the application of DEFINE.

UR - http://www.scopus.com/inward/record.url?scp=0029520823&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0029520823&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:0029520823

SP - 252

EP - 259

ER -