Demonstration of the remote exploration and experimentation (REE) fault-tolerant parallel-processing supercomputer for spacecraft onborad scientific data processing

Fannie Chen, Loring Craymer, Jeff Deifik, Alvin J. Fogel, Daniel S. Katz, Alfred G. Silliman, Raphael R. Some, Sean A. Upchurch, Keith Whisnant

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper is the written explanation for a demonstration of the REE Project's work to-date. The demonstration is intended to simulate an REE system that might exist on a Mars Rover, consisting of multiple COTS processors, a COTS network, a COTS node-level operating system, REE middleware, and an REE application. The specific application performs texture processing of images. It was chosen as a building block of automated geological processing that will eventually be used for both navigation and data processing. Because the COTS hardware is not radiation hardened, SEU-induced soft errors will occur. These errors are simulated in the demonstration by use of a software-implemented fault-injector, and are injected at a rate much higher than is realistic for the sake of viewer interest. Both the application and the middleware contain mechanisms for both detection of and recovery from these faults, and these mechanisms are tested by this very high fault-rate. The consequence of the REE system being able to tolerate this fault rate while continuing to process data is that the system will easily be able to handle the true fault rate.

Original languageEnglish (US)
Title of host publicationProceedings of the 2002 International Conference on Dependable Systems and Networks
Pages367-372
Number of pages6
DOIs
StatePublished - 2000
Externally publishedYes
EventProceedings of the International Conference on Dependable Systems and Networks - New York, NY, United States
Duration: Jul 1 2001Jul 4 2001

Publication series

NameProceedings of the 2002 International Conference on Dependable Systems and Networks

Other

OtherProceedings of the International Conference on Dependable Systems and Networks
Country/TerritoryUnited States
CityNew York, NY
Period7/1/017/4/01

ASJC Scopus subject areas

  • General Engineering

Fingerprint

Dive into the research topics of 'Demonstration of the remote exploration and experimentation (REE) fault-tolerant parallel-processing supercomputer for spacecraft onborad scientific data processing'. Together they form a unique fingerprint.

Cite this