DocumentCode
3385182
Title
NetLogger: a toolkit for distributed system performance tuning and debugging
Author
Gunter, Dan ; Tierney, Brian
Author_Institution
Lawrence Berkeley Nat. Lab., CA, USA
fYear
2003
fDate
24-28 March 2003
Firstpage
97
Lastpage
100
Abstract
Developers and users of high-performance distributed systems often observe performance problems such as unexpectedly low throughput or high latency. Determining the source of the performance problems requires detailed end-to-end instrumentation of all components, including the applications, operating systems, hosts, and networks. In this paper we describe a methodology that enables the real-time diagnosis of performance problems in complex high-performance distributed systems. The methodology includes tools for generating timestamped event logs that can be used to provide detailed end-to-end application and system level monitoring; and tools for visualizing the log data and real-time state of the distributed system. This methodology, called NetLogger, has proven invaluable for diagnosing problems in networks and in distributed systems code. This approach is novel in that it combines network, host, and application-level monitoring, providing a complete view of the entire system. NetLogger is designed to be extremely lightweight, and includes a mechanism for reliably collecting monitoring events from multiple distributed locations.
Keywords
computer network management; computerised monitoring; data visualisation; performance evaluation; NetLogger toolkit; application-level monitoring; debugging; distributed system performance tuning; high-performance distributed systems; host-level monitoring; latency; log data visualization; network-level monitoring; real-time diagnosis; system level monitoring; throughput; timestamped event logs; Condition monitoring; Data visualization; Debugging; Delay; Instruments; Libraries; Operating systems; Real time systems; System performance; Throughput;
fLanguage
English
Publisher
ieee
Conference_Titel
Integrated Network Management, 2003. IFIP/IEEE Eighth International Symposium on
Print_ISBN
1-4020-7418-2
Type
conf
DOI
10.1109/INM.2003.1194164
Filename
1194164
Link To Document