DocumentCode
2580290
Title
Autonomous configuration of grid monitoring systems
Author
Shirose, Ken´ichiro ; Matsuoka, Satoshi ; Nakada, Hidemoto ; Ogawa, Hirotaka
Author_Institution
Tokyo Inst. of Technol., Japan
fYear
2004
fDate
26-30 Jan. 2004
Firstpage
651
Lastpage
657
Abstract
The problem with practical, large-scale deployment of grid monitoring system is that it takes considerable management cost and skills to maintain the level of quality required by production usage since the monitoring system is fundamentally distributed, need to be running continuously, and in itself likely be affected by the various faults and dynamic reconfigurations of the grid itself. Although their automated management would be desirable, there are several difficulties, distributed faults and reconfigurations, component interdependencies, and scaling to maintain performance while minimizing probing effect. Given our goal to develop a generalized autonomous management framework for grid monitoring, we have built a prototype, on top of NWS, featuring automatic configuration of its "clique" groups as well as coping with single-node faults without user intervention. An experimental deployment on the Tokyo Institute of Technology\´s Campus Grid (the Titech Grid) consisting of over 15 sites and 800 processors has shown the system to be robust in handling faults and reconfigurations, automatically deriving an ideal clique configuration for the head login nodes of each PC cluster in less than two minutes.
Keywords
computer network management; computerised monitoring; grid computing; performance evaluation; NWS; PC cluster; Titech Grid; automated management; automatic configuration; clique configuration; component interdependencies; distributed faults; distributed reconfigurations; grid monitoring system; head login nodes; management cost; management skills; single-node faults; user intervention; Computerized monitoring; Condition monitoring; Continuous production; Costs; Grid computing; Informatics; Large-scale systems; Production systems; Prototypes; Quality management;
fLanguage
English
Publisher
ieee
Conference_Titel
Applications and the Internet Workshops, 2004. SAINT 2004 Workshops. 2004 International Symposium on
Print_ISBN
0-7695-2050-2
Type
conf
DOI
10.1109/SAINTW.2004.1268702
Filename
1268702
Link To Document