Title :
Simulating and Detecting Radiation-Induced Errors for Onboard Machine Learning
Author :
Granat, Robert ; Tang, Benyang ; Bornstein, Benjamin ; Wagstaff, Kiri L. ; Turmon, Michael
Author_Institution :
Jet Propulsion Lab., California Inst. of Technol., Pasadena, CA, USA
Abstract :
Spacecraft processors and memory are subjected to high radiation doses and therefore employ radiation-hardened components. However, these components are orders of magnitude more expensive than typical desktop components, and they lag years behind in terms of speed and size. We have integrated algorithm-based fault tolerance (ABFT) methods into onboard data analysis algorithms to detect radiation-induced errors, which ultimately may permit the use of spacecraft memory that need not be fully hardened, reducing cost and increasing capability at the same time. We have also developed a lightweight software radiation simulator, BITFLIPS, that permits evaluation of error detection strategies in a controlled fashion, including the specification of the radiation rate and selective exposure of individual data structures. Using BITFLIPS, we evaluated our error detection methods when using a support vector machine to analyze data collected by the Mars Odyssey spacecraft. We observed good performance from both an existing ABFT method for matrix multiplication and a novel ABFT method for exponentiation. These techniques bring us a step closer to "rad-hard" machine learning algorithms.
Keywords :
data analysis; data structures; fault tolerant computing; learning (artificial intelligence); matrix algebra; military aircraft; support vector machines; Mars Odyssey spacecraft; desktop components; individual data structures; integrated algorithm-based fault tolerance methods; matrix multiplication; onboard data analysis algorithms; onboard machine learning; radiation-hardened components; radiation-induced errors detection; spacecraft memory; spacecraft processors; support vector machine; Costs; Data analysis; Error correction; Fault detection; Fault tolerance; Machine learning; Machine learning algorithms; Radiation detectors; Radiation hardening; Space vehicles; algorithm-based fault tolerance; onboard data analysis; radiation protection; support vector machines;
Conference_Titel :
Space Mission Challenges for Information Technology, 2009. SMC-IT 2009. Third IEEE International Conference on
Conference_Location :
Pasadena, CA
Print_ISBN :
978-0-7695-3637-8
DOI :
10.1109/SMC-IT.2009.22