Title :
High throughput grid computing with an IBM Blue Gene/L
Author :
Cope, Jason ; Oberg, Michael ; Tufo, Henry M. ; Voran, Theron ; Woitaszek, Matthew
Author_Institution :
Dept. of Comput. Sci., Univ. of Colorado, Boulder, CO
Abstract :
While much high-performance computing is performed using massively parallel MPI applications, many workflows execute jobs with a mix of processor counts. At the extreme end of the scale, some workloads consist of large quantities of single-processor jobs. These types of workflows lead to inefficient usage of massively parallel architectures such as the IBM Blue Gene/L (BG/L) because of allocation constraints forced by its unique system design. Recently, IBM introduced the ability to schedule individual processors on BG/L - a feature named high throughput computing (HTC) - creating an opportunity to exploit the systempsilas power efficiency for other classes of computing. In this paper, we present a Grid-enabled interface supporting HTC on BG/L. This interface accepts single-processor tasks using Globus GRAM, aggregates HTC tasks into BG/L partitions, and requests partition execution using the underlying system scheduler. By separating HTC task aggregation from scheduling, we provide the ability for workflows constructed using standard Grid middleware to run both parallel and serial jobs on the BG/L. We examine the startup latency and performance of running large quantities of HTC jobs. Finally, we deploy Daymet, a component of a coupled climate model, on a BG/L system using our HTC interface.
Keywords :
grid computing; middleware; processor scheduling; Daymet; IBM Blue Gene/L; coupled climate model; globus GRAM; grid-enabled interface supporting; high throughput grid computing; individual processors schedule; massively parallel MPI; partition execution; single-processor jobs; single-processor tasks; standard Grid middleware; startup latency; system scheduler; Aggregates; Application software; Computer architecture; Computer science; Grid computing; Power system interconnection; Power system modeling; Processor scheduling; Scalability; Throughput;
Conference_Titel :
Cluster Computing, 2007 IEEE International Conference on
Conference_Location :
Austin, TX
Print_ISBN :
978-1-4244-1387-4
Electronic_ISBN :
1552-5244
DOI :
10.1109/CLUSTR.2007.4629250