DocumentCode :
2846486
Title :
Overcoming Scalability Challenges for Tool Daemon Launching
Author :
Ahn, Dong H. ; Arnold, Dorian C. ; Supinski, B. ; Lee, Gregory L. ; Miller, Barton P. ; Schulz, Martin
Author_Institution :
Lawrence Livermore Nat. Lab., Livermore, CA
fYear :
2008
fDate :
9-12 Sept. 2008
Firstpage :
578
Lastpage :
585
Abstract :
Many tools that target parallel and distributed environments must co-locate a set of daemons with the distributed processes of the target application. However, efficient and portable deployment of these daemons on large scale systems is an unsolved problem. We overcome this gap with LaunchMON, a scalable, robust, portable, secure, and general purpose infrastructure for launching tool daemons. Its API allows tool builders to identify all processes of a target job, launch daemons on the relevant nodes and control daemon interaction. Our results show that LaunchMON scales to very large daemon counts and substantially enhances performance over existing ad hoc mechanisms.
Keywords :
application program interfaces; large-scale systems; parallel processing; software tools; API; ad hoc mechanisms; daemon interaction control; daemons portable deployment; distributed environments; distributed processes; large scale systems; parallel environments; tool daemon launching; Abstracts; Job design; Large-scale systems; Parallel processing; Resource management; Robustness; Runtime; Scalability; Standards development; Supercomputers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing, 2008. ICPP '08. 37th International Conference on
Conference_Location :
Portland, OR
ISSN :
0190-3918
Print_ISBN :
978-0-7695-3374-2
Electronic_ISBN :
0190-3918
Type :
conf
DOI :
10.1109/ICPP.2008.63
Filename :
4625896
Link To Document :
بازگشت