• DocumentCode
    3103941
  • Title

    Evaluation of strategies to reduce the impact of machine reclaim in cycle-stealing environments

  • Author

    Heymann, E. ; Senar, M.A. ; Luque, E. ; Livny, M.

  • Author_Institution
    Unitat d´´Arquitectura d´´Ordinadors i Sistemes Oper., Univ. Autonoma de Barcelona, Spain
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    320
  • Lastpage
    328
  • Abstract
    We investigate the scheduling problem that arises in parallel applications executing on a network of machines by using a mode of cycle-stealing. In this mode of execution a parallel application executes its tasks in several machines whenever they are idle. When the user reclaims the machine, tasks must relinquish control immediately. In this case, the parallel application has the risk of losing work in progress on reclaimed machines and, therefore, the total execution time of the parallel application will be affected by the need for rescheduling the pre-empted task. We first evaluate the impact on the performance of an application when it runs on two different scenarios: a set of N dedicated machines, and a set of N non-dedicated machines (in which pre-emption may occur). This study shows that losing machines may have a considerable impact on the execution time of the application and therefore, we propose and evaluate three simple strategies to alleviate this problem. All strategies are based on the use of additional machines, but they differ in the way that these extra machines are used. In the first strategy additional machines are added to the common pool of machines used by the application. The other two are based on task replication, in which the additional machines are used to execute certain tasks that are already running in other machines
  • Keywords
    resource allocation; scheduling; software performance evaluation; workstation clusters; application performance; cluster computing; cycle-stealing environments; dedicated machines; execution time; grid computing; machine reclaim; parallel applications; scheduling; task replication; Contracts; Cultural differences; Grid computing; High-speed networks; Intelligent networks; Middleware; Personal communication networks; Processor scheduling; Resource management; Workstations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing and the Grid, 2001. Proceedings. First IEEE/ACM International Symposium on
  • Conference_Location
    Brisbane, Qld.
  • Print_ISBN
    0-7695-1010-8
  • Type

    conf

  • DOI
    10.1109/CCGRID.2001.923209
  • Filename
    923209