HARDWARE-RELATED SOFTWARE ERRORS: MEASUREMENT AND ANALYSIS.

Ravishankar K Iyer, Paola Velardi

Research output: Contribution to journalArticle

Abstract

This paper describes an analysis of hardware-related software (HW/SW) errors on an MVS/SP operating system. The analysis procedure demonstrates a methodology for evaluating the interaction between hardware and software as it relates to system reliability. The paper examines the operating system's handling of HW/SW errors and also the effectiveness of recovery management. Nearly 35 percent of all observed software failures were found to be hardware-related. The analysis shows that the operating system is seldom able to diagnose that a software error may be hardware-related. The impact of HW/SW errors on the system is evaluated by measuring the effectiveness of system recovery in containing the propagation of HW/SW errors. The system failure probability for HW/SW errors is close to three times that for software errors in general. The observed HW/SW errors are seen to have a specific pattern, suggesting the possibility of the use of such error patterns for intelligent error prediction and recovery.

Original languageEnglish (US)
Pages (from-to)223-231
Number of pages9
JournalIEEE Transactions on Software Engineering
VolumeSE-11
Issue number2
DOIs
StatePublished - Jan 1 1985
Externally publishedYes

Fingerprint

Measurement errors
Hardware
Recovery

ASJC Scopus subject areas

  • Software

Cite this

HARDWARE-RELATED SOFTWARE ERRORS : MEASUREMENT AND ANALYSIS. / Iyer, Ravishankar K; Velardi, Paola.

In: IEEE Transactions on Software Engineering, Vol. SE-11, No. 2, 01.01.1985, p. 223-231.

Research output: Contribution to journalArticle

@article{9eaf9c66ab124e0296e860bba23b9139,
title = "HARDWARE-RELATED SOFTWARE ERRORS: MEASUREMENT AND ANALYSIS.",
abstract = "This paper describes an analysis of hardware-related software (HW/SW) errors on an MVS/SP operating system. The analysis procedure demonstrates a methodology for evaluating the interaction between hardware and software as it relates to system reliability. The paper examines the operating system's handling of HW/SW errors and also the effectiveness of recovery management. Nearly 35 percent of all observed software failures were found to be hardware-related. The analysis shows that the operating system is seldom able to diagnose that a software error may be hardware-related. The impact of HW/SW errors on the system is evaluated by measuring the effectiveness of system recovery in containing the propagation of HW/SW errors. The system failure probability for HW/SW errors is close to three times that for software errors in general. The observed HW/SW errors are seen to have a specific pattern, suggesting the possibility of the use of such error patterns for intelligent error prediction and recovery.",
author = "Iyer, {Ravishankar K} and Paola Velardi",
year = "1985",
month = "1",
day = "1",
doi = "10.1109/TSE.1985.232198",
language = "English (US)",
volume = "SE-11",
pages = "223--231",
journal = "IEEE Transactions on Software Engineering",
issn = "0098-5589",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
number = "2",

}

TY - JOUR

T1 - HARDWARE-RELATED SOFTWARE ERRORS

T2 - MEASUREMENT AND ANALYSIS.

AU - Iyer, Ravishankar K

AU - Velardi, Paola

PY - 1985/1/1

Y1 - 1985/1/1

N2 - This paper describes an analysis of hardware-related software (HW/SW) errors on an MVS/SP operating system. The analysis procedure demonstrates a methodology for evaluating the interaction between hardware and software as it relates to system reliability. The paper examines the operating system's handling of HW/SW errors and also the effectiveness of recovery management. Nearly 35 percent of all observed software failures were found to be hardware-related. The analysis shows that the operating system is seldom able to diagnose that a software error may be hardware-related. The impact of HW/SW errors on the system is evaluated by measuring the effectiveness of system recovery in containing the propagation of HW/SW errors. The system failure probability for HW/SW errors is close to three times that for software errors in general. The observed HW/SW errors are seen to have a specific pattern, suggesting the possibility of the use of such error patterns for intelligent error prediction and recovery.

AB - This paper describes an analysis of hardware-related software (HW/SW) errors on an MVS/SP operating system. The analysis procedure demonstrates a methodology for evaluating the interaction between hardware and software as it relates to system reliability. The paper examines the operating system's handling of HW/SW errors and also the effectiveness of recovery management. Nearly 35 percent of all observed software failures were found to be hardware-related. The analysis shows that the operating system is seldom able to diagnose that a software error may be hardware-related. The impact of HW/SW errors on the system is evaluated by measuring the effectiveness of system recovery in containing the propagation of HW/SW errors. The system failure probability for HW/SW errors is close to three times that for software errors in general. The observed HW/SW errors are seen to have a specific pattern, suggesting the possibility of the use of such error patterns for intelligent error prediction and recovery.

UR - http://www.scopus.com/inward/record.url?scp=0022013529&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0022013529&partnerID=8YFLogxK

U2 - 10.1109/TSE.1985.232198

DO - 10.1109/TSE.1985.232198

M3 - Article

AN - SCOPUS:0022013529

VL - SE-11

SP - 223

EP - 231

JO - IEEE Transactions on Software Engineering

JF - IEEE Transactions on Software Engineering

SN - 0098-5589

IS - 2

ER -