DocumentCode
3272981
Title
Cost Reduction in High Power Computing Using a Deferred Repair Strategy: A Simulation Study
Author
Koçyigit, Altan ; Gemikonakli, Orhan ; Ever, Enver
fYear
2008
fDate
1-3 April 2008
Firstpage
354
Lastpage
359
Abstract
Fault-tolerant systems with repair-upon-failure strategy can become expensive in terms of labour and time. Especially for homogeneous multi-server systems, if no control hierarchy exists, postponing non essential repairs can reduce these costs without affecting the availability of the whole system significantly. Of course, while postponing these repairs, it is essential to keep the whole system capable to deal with user requests. For this purpose, usually, a threshold value is defined which represents the minimum number of servers the system administrator should keep operative. Performability evaluation of such systems is very important since the systems are fault tolerant. In this paper, the simulation of large scale multi-server systems, with identical servers, serving a stream of arriving jobs is considered. The cost of running such systems with various deferred repair strategies has been calculated and compared to the cost of using a repair-upon failure strategy.
Keywords
Availability; Computational modeling; Computer simulation; Control systems; Cost function; Degradation; Fault tolerant systems; Monte Carlo methods; Performance evaluation; Power system modeling; Deferred repairs; mutiprocessor systems;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Modeling and Simulation, 2008. UKSIM 2008. Tenth International Conference on
Conference_Location
Cambridge, UK
Print_ISBN
0-7695-3114-8
Type
conf
DOI
10.1109/UKSIM.2008.122
Filename
4488957
Link To Document