Title :
Integrating Computing Resources on Multiple Grid-enabled Job Scheduling Systems Through a Grid RPC System
Author :
Nakajima, Yoshihiro ; Sato, Mitsuhisa ; Aida, Yoshiaki ; Boku, Taisuke ; Cappello, Franck
Author_Institution :
University of Tsukuba, Japan
Abstract :
We present a framework for a parallel programming model by remote procedure calls bridging between largescale computing resource pools managed by multiple gridenabled job scheduling systems. With this system, the user can exploit not only each remote servers and clusters, but also computing resources provided with grid-enabled job scheduling systems located on different sites. This framework requires a Grid RPC system to decouple the computation in a remote node from the Grid RPC mechanism and uses document-based communication rather than connection-based communication. We implemented the proposed framework as an extension of the OmniRPC system, which is a Grid RPC system for parallel programming in a grid environment. We designed a general interface to adapt the OmniRPC system to various grid-enabled job scheduling systems easily and applied the proposed system to several grid-enabled job scheduling systems, including XtremWeb, CyberGRIP, Condor and Grid Engine. we show the preliminary performance of these implementations using a phylogenetic application. We found that the proposed system can achieve approximately the same performance as using OmniRPC and can handle interruptions in worker programs on remote nodes.
Keywords :
Computer networks; Concurrent computing; Distributed computing; Grid computing; Job shop scheduling; Parallel programming; Pervasive computing; Processor scheduling; Resource management; Systems engineering and theory;
Conference_Titel :
Cluster Computing and the Grid, 2006. CCGRID 06. Sixth IEEE International Symposium on
Conference_Location :
Singapore
Print_ISBN :
0-7695-2585-7
DOI :
10.1109/CCGRID.2006.60