• DocumentCode
    244489
  • Title

    Distributed scheduling and data sharing in late-binding overlays

  • Author

    Delgado Peris, Antonio ; Hernandez, Jose M. ; Huedo, Eduardo

  • Author_Institution
    Sci. Comput. Unit, CIEMAT, Madrid, Spain
  • fYear
    2014
  • fDate
    21-25 July 2014
  • Firstpage
    129
  • Lastpage
    136
  • Abstract
    Pull-based late-binding overlays are used in some of today´s largest computational grids. Job agents are submitted to resources with the duty of retrieving real workload from a central queue at runtime. This helps overcome the problems of these complex environments: heterogeneity, imprecise status information and relatively high failure rates. In addition, the late job assignment allows dynamic adaptation to changes in grid conditions or user priorities. However, as the scale grows, the central assignment queue may become a bottleneck for the whole system. This article presents a distributed scheduling architecture for late-binding overlays, which addresses this issue by letting execution nodes build a distributed hash table and delegating job matching and assignment to them. This reduces the load on the central server and makes the system much more scalable and robust. Scalability makes fine-grained scheduling possible and enables new functionalities, like the implementation of a distributed data cache on the execution nodes, which helps alleviate the commonly congested grid storage services.
  • Keywords
    file organisation; grid computing; peer-to-peer computing; central assignment queue; data sharing; distributed data cache; distributed hash table; distributed scheduling; grid storage service; job agents; job matching; late job assignment; pull-based late-binding overlays; Distributed algorithms; Measurement; Peer-to-peer computing; Grid and Cluster Computing; Peer-to-Peer Architectures and Networks; Reliable Parallel and Distributed Algorithms; Scalable Computing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing & Simulation (HPCS), 2014 International Conference on
  • Conference_Location
    Bologna
  • Print_ISBN
    978-1-4799-5312-7
  • Type

    conf

  • DOI
    10.1109/HPCSim.2014.6903678
  • Filename
    6903678