TY - JOUR
T1 - HARDWARE-RELATED SOFTWARE ERRORS
T2 - MEASUREMENT AND ANALYSIS.
AU - Iyer, Ravishankar K.
AU - Velardi, Paola
PY - 1985/1/1
Y1 - 1985/1/1
N2 - This paper describes an analysis of hardware-related software (HW/SW) errors on an MVS/SP operating system. The analysis procedure demonstrates a methodology for evaluating the interaction between hardware and software as it relates to system reliability. The paper examines the operating system's handling of HW/SW errors and also the effectiveness of recovery management. Nearly 35 percent of all observed software failures were found to be hardware-related. The analysis shows that the operating system is seldom able to diagnose that a software error may be hardware-related. The impact of HW/SW errors on the system is evaluated by measuring the effectiveness of system recovery in containing the propagation of HW/SW errors. The system failure probability for HW/SW errors is close to three times that for software errors in general. The observed HW/SW errors are seen to have a specific pattern, suggesting the possibility of the use of such error patterns for intelligent error prediction and recovery.
AB - This paper describes an analysis of hardware-related software (HW/SW) errors on an MVS/SP operating system. The analysis procedure demonstrates a methodology for evaluating the interaction between hardware and software as it relates to system reliability. The paper examines the operating system's handling of HW/SW errors and also the effectiveness of recovery management. Nearly 35 percent of all observed software failures were found to be hardware-related. The analysis shows that the operating system is seldom able to diagnose that a software error may be hardware-related. The impact of HW/SW errors on the system is evaluated by measuring the effectiveness of system recovery in containing the propagation of HW/SW errors. The system failure probability for HW/SW errors is close to three times that for software errors in general. The observed HW/SW errors are seen to have a specific pattern, suggesting the possibility of the use of such error patterns for intelligent error prediction and recovery.
UR - http://www.scopus.com/inward/record.url?scp=0022013529&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0022013529&partnerID=8YFLogxK
U2 - 10.1109/TSE.1985.232198
DO - 10.1109/TSE.1985.232198
M3 - Article
AN - SCOPUS:0022013529
VL - SE-11
SP - 223
EP - 231
JO - IEEE Transactions on Software Engineering
JF - IEEE Transactions on Software Engineering
SN - 0098-5589
IS - 2
ER -