Title :
Error Behavior Comparison of Multiple Computing Systems: A Case Study Using Linux on Pentium, Solaris on SPARC, and AIX on POWER
Author :
Chen, Daniel ; Jacques-Silva, Gabriela ; Kalbarczyk, Zbigniew ; Iyer, Ravishankar K. ; Mealey, Bruce
Author_Institution :
Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Abstract :
This paper presents an approach to conducting experimental studies for the characterization and comparison of the error behavior in different computing systems. The proposed approach is applied to characterize and compare the error behavior of three commercial systems (Linux 2.6 on Pentium 4, Solaris 10 on UltraSPARC IIIi, and AIX 5.3 on POWER 5) under hardware transient faults. The data is obtained by conducting extensive fault injection into kernel code, kernel stack, and system registers with the NFTAPE framework while running the Apache Web server as a workload. The error behavior comparison shows that the Linux system has the highest average crash latency, the Solaris system has the highest hang rate, and the AIX system has the lowest error sensitivity and the least amount of crashes in the more severe categories.
Keywords :
Internet; Linux; error handling; file servers; operating system kernels; program diagnostics; AIX; AIX 5.3; Apache Web server; Linux; Linux 2.6; NFTAPE framework; POWER 5; Pentium 4; Solaris; Solaris 10; UltraSPARC IIIi; error behavior; fault injection; hardware transient fault; kernel code; kernel stack; multiple computing system; system register; Computer crashes; Delay; Fault detection; Hardware; Kernel; Linux; Operating systems; Registers; Stress measurement; Web server; dependability analysis; fault injection; fault tolerance; operating systems;
Conference_Titel :
Dependable Computing, 2008. PRDC '08. 14th IEEE Pacific Rim International Symposium on
Conference_Location :
Taipei
Print_ISBN :
978-0-7695-3448-0
Electronic_ISBN :
978-0-7695-3448-0
DOI :
10.1109/PRDC.2008.35