TY - GEN
T1 - Restart-based fault-tolerance
T2 - 23rd IEEE International Conference on Embedded and Real-Time Computing Systems and Applications, RTCSA 2017
AU - Abdi, Fardin
AU - Mancuso, Renato
AU - Tabish, Rohan
AU - Caccamo, Marco
N1 - Funding Information:
The material presented in this paper is based upon work supported by the National Science Foundation (NSF) under grant numbers CNS-1302563 and CNS-1646383. Any opinions, findings, and conclusions or recommendations expressed in this publication are those of the authors and do not necessarily reflect the views of the NSF and other sponsors.
Publisher Copyright:
© 2017 IEEE.
PY - 2017/9/19
Y1 - 2017/9/19
N2 - Embedded systems in safety-critical environments are continuously required to deliver more performance and functionality, while expected to provide verified safety guarantees. Nonetheless, platform-wide software verification (required for safety) is often expensive. Therefore, design methods that enable utilization of components such as real-time operating systems (RTOS), without requiring their correctness to guarantee safety, is necessary. In this paper, we propose a design approach to deploy safe-by-design embedded systems. To attain this goal, we rely on a small core of verified software to handle faults in applications and RTOS and recover from them while ensuring that timing constraints of safety-critical tasks are always satisfied. Faults are detected by monitoring the application timing and fault-recovery is achieved via full platform restart and software reload, enabled by the short restart time of embedded systems. Schedulability analysis is used to ensure that the timing constraints of critical plant control tasks are always satisfied in spite of faults and consequent restarts. We derive schedulability results for four restart-tolerant task models. We use a simulator to evaluate and compare the performance of the considered scheduling models.
AB - Embedded systems in safety-critical environments are continuously required to deliver more performance and functionality, while expected to provide verified safety guarantees. Nonetheless, platform-wide software verification (required for safety) is often expensive. Therefore, design methods that enable utilization of components such as real-time operating systems (RTOS), without requiring their correctness to guarantee safety, is necessary. In this paper, we propose a design approach to deploy safe-by-design embedded systems. To attain this goal, we rely on a small core of verified software to handle faults in applications and RTOS and recover from them while ensuring that timing constraints of safety-critical tasks are always satisfied. Faults are detected by monitoring the application timing and fault-recovery is achieved via full platform restart and software reload, enabled by the short restart time of embedded systems. Schedulability analysis is used to ensure that the timing constraints of critical plant control tasks are always satisfied in spite of faults and consequent restarts. We derive schedulability results for four restart-tolerant task models. We use a simulator to evaluate and compare the performance of the considered scheduling models.
UR - http://www.scopus.com/inward/record.url?scp=85032720048&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85032720048&partnerID=8YFLogxK
U2 - 10.1109/RTCSA.2017.8046320
DO - 10.1109/RTCSA.2017.8046320
M3 - Conference contribution
AN - SCOPUS:85032720048
T3 - RTCSA 2017 - 23rd IEEE International Conference on Embedded and Real-Time Computing Systems and Applications
BT - RTCSA 2017 - 23rd IEEE International Conference on Embedded and Real-Time Computing Systems and Applications
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 16 August 2017 through 18 August 2017
ER -