TY - GEN
T1 - Demonstration of the remote exploration and experimentation (REE) fault-tolerant parallel-processing supercomputer for spacecraft onborad scientific data processing
AU - Chen, Fannie
AU - Craymer, Loring
AU - Deifik, Jeff
AU - Fogel, Alvin J.
AU - Katz, Daniel S.
AU - Silliman, Alfred G.
AU - Some, Raphael R.
AU - Upchurch, Sean A.
AU - Whisnant, Keith
PY - 2000
Y1 - 2000
N2 - This paper is the written explanation for a demonstration of the REE Project's work to-date. The demonstration is intended to simulate an REE system that might exist on a Mars Rover, consisting of multiple COTS processors, a COTS network, a COTS node-level operating system, REE middleware, and an REE application. The specific application performs texture processing of images. It was chosen as a building block of automated geological processing that will eventually be used for both navigation and data processing. Because the COTS hardware is not radiation hardened, SEU-induced soft errors will occur. These errors are simulated in the demonstration by use of a software-implemented fault-injector, and are injected at a rate much higher than is realistic for the sake of viewer interest. Both the application and the middleware contain mechanisms for both detection of and recovery from these faults, and these mechanisms are tested by this very high fault-rate. The consequence of the REE system being able to tolerate this fault rate while continuing to process data is that the system will easily be able to handle the true fault rate.
AB - This paper is the written explanation for a demonstration of the REE Project's work to-date. The demonstration is intended to simulate an REE system that might exist on a Mars Rover, consisting of multiple COTS processors, a COTS network, a COTS node-level operating system, REE middleware, and an REE application. The specific application performs texture processing of images. It was chosen as a building block of automated geological processing that will eventually be used for both navigation and data processing. Because the COTS hardware is not radiation hardened, SEU-induced soft errors will occur. These errors are simulated in the demonstration by use of a software-implemented fault-injector, and are injected at a rate much higher than is realistic for the sake of viewer interest. Both the application and the middleware contain mechanisms for both detection of and recovery from these faults, and these mechanisms are tested by this very high fault-rate. The consequence of the REE system being able to tolerate this fault rate while continuing to process data is that the system will easily be able to handle the true fault rate.
UR - http://www.scopus.com/inward/record.url?scp=0034590460&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0034590460&partnerID=8YFLogxK
U2 - 10.1109/ICDSN.2000.857562
DO - 10.1109/ICDSN.2000.857562
M3 - Conference contribution
AN - SCOPUS:0034590460
SN - 0769507085
SN - 9780769507088
T3 - Proceedings of the 2002 International Conference on Dependable Systems and Networks
SP - 367
EP - 372
BT - Proceedings of the 2002 International Conference on Dependable Systems and Networks
T2 - Proceedings of the International Conference on Dependable Systems and Networks
Y2 - 1 July 2001 through 4 July 2001
ER -