Title :
Grid aware HA-OSCAR
Author :
Limaye, Kshitij ; Leangsuksun, Box ; Munganuru, Venkata K. ; Greenwood, Zeno ; Scott, Stephen L. ; Libby, Richard ; Chanchio, Kaisdit
Author_Institution :
Louisiana Tech. Univ., Ruston, LA, USA
Abstract :
Physicists today have employed grid technology to overcome various resource level hurdles. The collective resource utilization achieved through grid computing is critical to the overall computing capacity of the community and should be guaranteed. In an environment where job sites are cluster systems, a service node failure renders a whole system outage. Our grid-aware HA-OSCAR effort was motivated by the popularity of the cluster architecture in the grid environment. We propose the high-availability architecture, HA-OSCAR, for cluster-based job sites in the grid environment. This architecture deals with fault tolerance at the service level complementing task-based solutions such as checkpoint/restart. We discuss various service availability issues related to the grid, some issues and preliminary results obtained while implementing the smart failover feature and the automated grid installation package. Our report entails the performance benefits achieved after applying the HA-OSCAR solution to the cluster components of the grid compared to regular Beowulf style cluster solutions.
Keywords :
computer network management; fault tolerant computing; grid computing; public domain software; software packages; workstation clusters; Open Source Cluster Application Resource; automated grid installation package; cluster-based job site; fault tolerance; grid technology; grid-aware HA-OSCAR; high-availability architecture; service availability; task-level failure handling; Availability; Collaboration; Computer architecture; Distributed computing; Fault tolerance; Grid computing; Laboratories; Packaging; Resource management; US Department of Energy;
Conference_Titel :
High Performance Computing Systems and Applications, 2005. HPCS 2005. 19th International Symposium on
Print_ISBN :
0-7695-2343-9
DOI :
10.1109/HPCS.2005.28