Software Fault Tolerance for Cyber-Physical Systems via Full System Restart

Pushpak Jagtap, Fardin Abdi, Matthias Rungger, Majid Zamani, Marco Caccamo

Research output: Contribution to journalArticlepeer-review

Abstract

The article addresses the issue of reliability of complex embedded control systems in the safety-critical environment. In this article, we propose a novel approach to design controller that (i) guarantees the safety of nonlinear physical systems, (ii) enables safe system restart during runtime, and (iii) allows the use of complex, unverified controllers (e.g., neural networks) that drive the physical systems toward complex specifications. We use abstraction-based controller synthesis approach to design a formally verified controller that provides application and system-level fault tolerance along with safety guarantee. Moreover, our approach is implementable using a commercial-off-the-shelf (COTS) processing unit. To demonstrate the efficacy of our solution and to verify the safety of the system under various types of faults injected in applications and in the underlying real-time operating system (RTOS), we implemented the proposed controller for the inverted pendulum and three degrees-of-freedom (3-DOF) helicopter.

Original languageEnglish (US)
Article number47
JournalACM Transactions on Cyber-Physical Systems
Volume4
Issue number4
DOIs
StatePublished - Aug 2020

Keywords

  • Cyber-physical systems
  • abstraction-based control
  • fault-tolerance
  • full system restart
  • nonlinear systems

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Hardware and Architecture
  • Human-Computer Interaction
  • Control and Optimization

Fingerprint Dive into the research topics of 'Software Fault Tolerance for Cyber-Physical Systems via Full System Restart'. Together they form a unique fingerprint.

Cite this