DocumentCode
245405
Title
Soft Error Resiliency Characterization on IBM BlueGene/Q Processor
Author
Chen-Yong Cher ; Muller, K. Paul ; Haring, Ruud A. ; Satterfield, David L. ; Musta, Thomas E. ; Gooding, Thomas M. ; Davis, Kristan D. ; Dombrowa, Marc B. ; Kopcsay, Gerard V. ; Senger, Robert M. ; Sugawara, Yoko ; Sugavanam, Krishnan
fYear
2014
fDate
20-23 Jan. 2014
Firstpage
385
Lastpage
387
Abstract
Soft Error Resiliency (SER) is a major concern for Petascale high performance computing (HPC) systems. In designing Blue Gene/Q (BG/Q) [8], many mechanisms were deployed to target SER including extensive use of Silicon-On-Insulator (SOI), radiation-hardened latches [7,13], detection and correction in on-chip arrays, and very low radiation packaging materials. On the other hand, it is well known that application behavior has major impacts on the masking (or “derating” factor) in system SER calculations. The principal goal of this project is to understand the interaction between BG/Q hardware and high-performance applications when it comes to SER by performing and evaluating a chip irradiation experiment.
Keywords
fault tolerant computing; microprocessor chips; multiprocessing systems; parallel processing; HPC systems; IBM BlueGene processor; IBM Q processor; SER calculations; SER characterization; SOI; application behavior; chip irradiation experiment; derating factor; masking factor; on-chip arrays; petascale high performance computing systems; radiation-hardened latches; silicon-on-insulator; soft error resiliency characterization; very low radiation packaging materials; Benchmark testing; Circuit faults; Hardware; Latches; Neutrons; Packaging; Radiation effects;
fLanguage
English
Publisher
ieee
Conference_Titel
Design Automation Conference (ASP-DAC), 2014 19th Asia and South Pacific
Conference_Location
Singapore
Type
conf
DOI
10.1109/ASPDAC.2014.6742920
Filename
6742920
Link To Document