• DocumentCode
    1153175
  • Title

    Adaptive divisible load scheduling strategies for workstation clusters with unknown network resources

  • Author

    Ghose, Debasish ; Kim, Hyoung Joong ; Kim, Tae Hoon

  • Author_Institution
    Dept. of Aerosp. Eng., Indian Inst. of Sci., Bangalore, India
  • Volume
    16
  • Issue
    10
  • fYear
    2005
  • Firstpage
    897
  • Lastpage
    907
  • Abstract
    Conventional divisible load scheduling algorithms attempt to achieve optimal partitioning of massive loads to be distributed among processors in a distributed computing system in the presence of communication delays in the network. However, these algorithms depend strongly upon the assumption of prior knowledge of network parameters and cannot handle variations or lack of information about these parameters. In this paper, we present an adaptive strategy that estimates network parameter values using a probing technique and use them to obtain optimal load partitioning. Three algorithms, based on the same strategy, are presented in the paper, incorporating the ability to cope with unknown network parameters. Several illustrative numerical examples are given. Finally, we implement the adaptive algorithms on an actual network of processor nodes using MPI implementation and demonstrate the feasibility of the adaptive approach.
  • Keywords
    delays; message passing; multiprocessing systems; processor scheduling; resource allocation; workstation clusters; MPI; adaptive divisible load scheduling; communication delay; distributed computing system; multiprocessor system; network resources; optimal load partitioning; probing technique; task partitioning; workstation cluster; Adaptive scheduling; Delay effects; Distributed computing; Distribution strategy; Load modeling; Partitioning algorithms; Performance analysis; Processor scheduling; Scheduling algorithm; Workstations; Scheduling and task partitioning; distributed applications; divisible loads; multiprocessor systems.; workstations;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2005.117
  • Filename
    1501802