Title :
Transparent load sharing in distributed systems: decentralized design alternatives based on the Condor package
Author :
Hou, Chao-Ju ; Shin, Kang G. ; Tsukada, Thomas Kaeppel
Author_Institution :
Dept. of Electr. & Comput. Eng., Wisconsin Univ., Madison, WI, USA
Abstract :
In recent years a number of load sharing (LS) mechanisms have been proposed or implemented to fully utilize system resources. We design and implement a decentralized LS mechanism based on the Condor package, and give in this paper a description of our design and implementation approaches. Two important features of the design are the use of region-change broadcasts in the information policy to provide each workstation with timely state information at the minimum communication cost, and the use of preferred list in the location policy to avoid task collisions. With these two features, we remove the central manager workstation in Condor, configure its functionalities into each participating workstation, and thus enhance the capability to tolerate single workstation failure and the reliability of Condor. We also discuss the experiments we conduct on the LS mechanism and the observations we obtained from empirical data
Keywords :
distributed processing; resource allocation; Condor package; decentralized design; distributed systems; load sharing; region-change broadcasts; task collisions; Availability; Broadcasting; Chaotic communication; Costs; Delay; Design engineering; Kernel; Packaging; Software packages; Workstations;
Conference_Titel :
Reliable Distributed Systems, 1994. Proceedings., 13th Symposium on
Conference_Location :
Dana Point, CA
Print_ISBN :
0-8186-6575-0
DOI :
10.1109/RELDIS.1994.336895