TY - JOUR
T1 - HARDWARE-RELATED SOFTWARE ERRORS
T2 - MEASUREMENT AND ANALYSIS.
AU - Iyer, Ravishankar K.
AU - Velardi, Paola
N1 - Funding Information:
Manuscript received December 19, 2008; revised March 02, 2009. First published March 24, 2009; current version published August 26, 2009. This work was supported by National Institute of Biomedical Imaging and Bioengineering under Grant RO1 EB002798. This work was reported in part at the 2006 and 2007 IEEE Medical Imaging Conferences and in the accompanying proceedings. The contents are solely the responsibility of the authors and do not necessarily represent the official views of the National Institutes of Health. Asterisk indicates corresponding author. *A. Lehovich is with the Division of Nuclear Medicine, Department of Radiology, University of Massachusetts Medical School, Worcester, MA 01655 USA (e-mail: [email protected]).
PY - 1985
Y1 - 1985
N2 - This paper describes an analysis of hardware-related software (HW/SW) errors on an MVS/SP operating system. The analysis procedure demonstrates a methodology for evaluating the interaction between hardware and software as it relates to system reliability. The paper examines the operating system's handling of HW/SW errors and also the effectiveness of recovery management. Nearly 35 percent of all observed software failures were found to be hardware-related. The analysis shows that the operating system is seldom able to diagnose that a software error may be hardware-related. The impact of HW/SW errors on the system is evaluated by measuring the effectiveness of system recovery in containing the propagation of HW/SW errors. The system failure probability for HW/SW errors is close to three times that for software errors in general. The observed HW/SW errors are seen to have a specific pattern, suggesting the possibility of the use of such error patterns for intelligent error prediction and recovery.
AB - This paper describes an analysis of hardware-related software (HW/SW) errors on an MVS/SP operating system. The analysis procedure demonstrates a methodology for evaluating the interaction between hardware and software as it relates to system reliability. The paper examines the operating system's handling of HW/SW errors and also the effectiveness of recovery management. Nearly 35 percent of all observed software failures were found to be hardware-related. The analysis shows that the operating system is seldom able to diagnose that a software error may be hardware-related. The impact of HW/SW errors on the system is evaluated by measuring the effectiveness of system recovery in containing the propagation of HW/SW errors. The system failure probability for HW/SW errors is close to three times that for software errors in general. The observed HW/SW errors are seen to have a specific pattern, suggesting the possibility of the use of such error patterns for intelligent error prediction and recovery.
UR - http://www.scopus.com/inward/record.url?scp=0022013529&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0022013529&partnerID=8YFLogxK
U2 - 10.1109/TSE.1985.232198
DO - 10.1109/TSE.1985.232198
M3 - Article
AN - SCOPUS:0022013529
SN - 0098-5589
VL - SE-11
SP - 223
EP - 231
JO - IEEE Transactions on Software Engineering
JF - IEEE Transactions on Software Engineering
IS - 2
ER -