• DocumentCode
    3385182
  • Title

    NetLogger: a toolkit for distributed system performance tuning and debugging

  • Author

    Gunter, Dan ; Tierney, Brian

  • Author_Institution
    Lawrence Berkeley Nat. Lab., CA, USA
  • fYear
    2003
  • fDate
    24-28 March 2003
  • Firstpage
    97
  • Lastpage
    100
  • Abstract
    Developers and users of high-performance distributed systems often observe performance problems such as unexpectedly low throughput or high latency. Determining the source of the performance problems requires detailed end-to-end instrumentation of all components, including the applications, operating systems, hosts, and networks. In this paper we describe a methodology that enables the real-time diagnosis of performance problems in complex high-performance distributed systems. The methodology includes tools for generating timestamped event logs that can be used to provide detailed end-to-end application and system level monitoring; and tools for visualizing the log data and real-time state of the distributed system. This methodology, called NetLogger, has proven invaluable for diagnosing problems in networks and in distributed systems code. This approach is novel in that it combines network, host, and application-level monitoring, providing a complete view of the entire system. NetLogger is designed to be extremely lightweight, and includes a mechanism for reliably collecting monitoring events from multiple distributed locations.
  • Keywords
    computer network management; computerised monitoring; data visualisation; performance evaluation; NetLogger toolkit; application-level monitoring; debugging; distributed system performance tuning; high-performance distributed systems; host-level monitoring; latency; log data visualization; network-level monitoring; real-time diagnosis; system level monitoring; throughput; timestamped event logs; Condition monitoring; Data visualization; Debugging; Delay; Instruments; Libraries; Operating systems; Real time systems; System performance; Throughput;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Integrated Network Management, 2003. IFIP/IEEE Eighth International Symposium on
  • Print_ISBN
    1-4020-7418-2
  • Type

    conf

  • DOI
    10.1109/INM.2003.1194164
  • Filename
    1194164