DocumentCode :
3056560
Title :
Performability Models for Multi-Server Systems with High-Variance Repair Durations
Author :
Schwefel, Hans-Peter ; Antonios, Imad
Author_Institution :
Aalborg Univ., Aalborg
fYear :
2007
fDate :
25-28 June 2007
Firstpage :
770
Lastpage :
779
Abstract :
We consider cluster systems with multiple nodes where each server is prone to run tasks at a degraded level of service due to some software or hardware fault. The cluster serves tasks generated by remote clients, which are potentially queued at a dispatcher. We present an analytic queueing model of such systems, represented as an M/MMPP/1 queue, and derive and analyze exact numerical solutions for the mean and tail-probabilities of the queue-length distribution. The analysis shows that the distribution of the repair time is critical for these performability metrics. Additionally, in the case of high-variance repair times, the model reveals so-called blow-up points, at which the performance characteristics change dramatically. Since this blowup behavior is sensitive to a change in model parameters, it is critical for system designers to be aware of the conditions under which it occurs. Finally, we present simulation results that demonstrate the robustness of this qualitative blow-up behavior towards several model variations.
Keywords :
multiprocessing systems; workstation clusters; analytic queueing model; cluster systems; high-variance repair durations; multiserver systems; remote clients; repair time distribution; Analytical models; Computer crashes; Computer science; Degradation; Electric breakdown; Failure analysis; Hardware; Queueing analysis; Robustness; Software performance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Dependable Systems and Networks, 2007. DSN '07. 37th Annual IEEE/IFIP International Conference on
Conference_Location :
Edinburgh
Print_ISBN :
0-7695-2855-4
Type :
conf
DOI :
10.1109/DSN.2007.73
Filename :
4273028
Link To Document :
بازگشت