DocumentCode
3103941
Title
Evaluation of strategies to reduce the impact of machine reclaim in cycle-stealing environments
Author
Heymann, E. ; Senar, M.A. ; Luque, E. ; Livny, M.
Author_Institution
Unitat d´´Arquitectura d´´Ordinadors i Sistemes Oper., Univ. Autonoma de Barcelona, Spain
fYear
2001
fDate
2001
Firstpage
320
Lastpage
328
Abstract
We investigate the scheduling problem that arises in parallel applications executing on a network of machines by using a mode of cycle-stealing. In this mode of execution a parallel application executes its tasks in several machines whenever they are idle. When the user reclaims the machine, tasks must relinquish control immediately. In this case, the parallel application has the risk of losing work in progress on reclaimed machines and, therefore, the total execution time of the parallel application will be affected by the need for rescheduling the pre-empted task. We first evaluate the impact on the performance of an application when it runs on two different scenarios: a set of N dedicated machines, and a set of N non-dedicated machines (in which pre-emption may occur). This study shows that losing machines may have a considerable impact on the execution time of the application and therefore, we propose and evaluate three simple strategies to alleviate this problem. All strategies are based on the use of additional machines, but they differ in the way that these extra machines are used. In the first strategy additional machines are added to the common pool of machines used by the application. The other two are based on task replication, in which the additional machines are used to execute certain tasks that are already running in other machines
Keywords
resource allocation; scheduling; software performance evaluation; workstation clusters; application performance; cluster computing; cycle-stealing environments; dedicated machines; execution time; grid computing; machine reclaim; parallel applications; scheduling; task replication; Contracts; Cultural differences; Grid computing; High-speed networks; Intelligent networks; Middleware; Personal communication networks; Processor scheduling; Resource management; Workstations;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing and the Grid, 2001. Proceedings. First IEEE/ACM International Symposium on
Conference_Location
Brisbane, Qld.
Print_ISBN
0-7695-1010-8
Type
conf
DOI
10.1109/CCGRID.2001.923209
Filename
923209
Link To Document