Abstract

The paper describes the reliability microkernel framework (RMK), a loadable kernel module for providing application-aware reliability and dynamically configuring reliability mechanisms installed in RMK The RMK prototype is implemented in Linux and supports detection of application/OS failures and transparent application checkpointing. Experiment results show that the OS hang detection, which exploits characteristics of application and system behavior, can achieve high coverage (100% in our experiments) and low false positive rate. Moreover, the performance overhead is negligible because instruction counting is performed in hardware.

Original languageEnglish (US)
Title of host publicationProceedings - 12th Pacific Rim International Symposium on Dependable Computing, PRDC 2006
Pages55-62
Number of pages8
DOIs
StatePublished - 2006
Event12th Pacific Rim International Symposium on Dependable Computing, PRDC 2006 - Riverside, CA, United States
Duration: Dec 18 2006Dec 20 2006

Publication series

NameProceedings - 12th Pacific Rim International Symposium on Dependable Computing, PRDC 2006

Other

Other12th Pacific Rim International Symposium on Dependable Computing, PRDC 2006
Country/TerritoryUnited States
CityRiverside, CA
Period12/18/0612/20/06

ASJC Scopus subject areas

  • Hardware and Architecture
  • Software

Fingerprint

Dive into the research topics of 'An OS-level framework for providing application-aware reliability'. Together they form a unique fingerprint.

Cite this