• DocumentCode
    446502
  • Title

    Empirical evaluation of shared parallel execution on independently scheduled clusters

  • Author

    Ghanesh, Mala ; Kumar, Sathish ; Subhlok, Jaspal

  • Author_Institution
    Dept. of Comput. Sci., Houston Univ., TX, USA
  • Volume
    1
  • fYear
    2005
  • fDate
    9-12 May 2005
  • Firstpage
    309
  • Abstract
    Parallel machines are typically space shared, or time shared such that only one application executes on a group of nodes at any given time. It is generally assumed that executing multiple parallel applications simultaneously on a group of independently scheduled nodes is not efficient because of synchronization requirements. The central contribution of this paper is to demonstrate that performance of parallel applications with sharing is typically competitive for independent and coordinated (gang) scheduling on small compute clusters. There is a modest overhead due to uncoordinated scheduling but it is often compensated by better sharing of resources. The impact of sharing was studied for different numbers of nodes and threads and different memory and CPU requirements of competing applications. The significance of the CPU time slice, a key parameter in CPU scheduling, was also studied. Application characteristics and operating system scheduling policies are identified as the main factors that influence performance with node sharing. All experiments are performed with NAS benchmarks on a Linux cluster. The significance of this research is that it provides evidence to support flexible and decentralized scheduling and resource selection policies for cluster and grid environments.
  • Keywords
    grid computing; processor scheduling; workstation clusters; CPU scheduling; CPU time slice; Linux cluster; NAS benchmarks; compute clusters; coordinated gang scheduling; decentralized scheduling; grid environments; independent scheduling; independently scheduled clusters; key parameter; multiple parallel applications; node sharing; operating system scheduling; parallel machines; shared parallel execution; synchronization requirements; Application software; Communication system control; Computer science; Grid computing; High performance computing; Operating systems; Parallel machines; Processor scheduling; Switches; Yarn;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing and the Grid, 2005. CCGrid 2005. IEEE International Symposium on
  • Print_ISBN
    0-7803-9074-1
  • Type

    conf

  • DOI
    10.1109/CCGRID.2005.1558570
  • Filename
    1558570