DocumentCode :
2719642
Title :
Scalable cluster administration - Chiba City I approach and lessons learned
Author :
Navarro, John-Paul ; Evard, Rémy ; Nurmi, Dan ; Desai, Narayan
Author_Institution :
Div. of Math. & Comput. Sci., Argonne Nat. Lab., IL, USA
fYear :
2002
fDate :
2002
Firstpage :
215
Lastpage :
221
Abstract :
Systems administrators of large clusters often need to perform the same administrative task hundreds or thousands of times. Administrators have traditionally performed some time-consuming tasks, such as operating system installation, configuration, and maintenance, manually. By combining network services such as DHCP, TFTP, FTP, HTTP, and NFS with remote hardware control and scripted installation, configuration, and maintenance techniques, cluster administrators can automate these administrative tasks. Scalable cluster administration addresses this challenge: What hardware and software design techniques can cluster builders use to automate cluster administration on very large clusters? We describe the approach used in the Mathematics and Computer Science Division of Argonne National Laboratory on Chiba City I, a 314-node Linux cluster; and we analyze the scalability, flexibility, performance and reliability benefits and limitations from that approach.
Keywords :
computer network management; workstation clusters; Chiba City I; Linux cluster; cluster administration; network services; scalable cluster; Automatic control; Cities and towns; Computer science; Hardware; Laboratories; Linux; Mathematics; Operating systems; Performance analysis; Software design;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster Computing, 2002. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7695-2066-9
Type :
conf
DOI :
10.1109/CLUSTR.2002.1137749
Filename :
1137749
Link To Document :
بازگشت