DocumentCode :
3240592
Title :
Efficient, Unified, and Scalable Performance Monitoring for Multiprocessor Operating Systems
Author :
Wisniewski, Robert W. ; Rosenburg, Bryan
Author_Institution :
IBM T. J. Watson Research Center
fYear :
2003
fDate :
15-21 Nov. 2003
Firstpage :
3
Lastpage :
3
Abstract :
Programming, understanding, and tuning the performance of large multiprocessor systems is challenging. Experts have difficulty achieving good utilization for applications on large machines. The task of implementing a scalable system such as an operating system or database on large machines is even more challenging. And the importance of achieving good performance on multiprocessor machines is increasing as the number of cores per chip increases and as the size of multiprocessors increases. Crucial to achieving good performance is being able to understand the behavior of the system. We have developed an efficient, unified, and scalable tracing infrastructure that allows for correctness debugging, performance debugging, and performance monitoring of an operating system. The infrastructure allows variable-length events to be logged without locking and provides random access to the event stream. The infrastructure allows cheap and parallel logging of events by applications, libraries, servers, and the kernel. The infrastructure was designed for K42, a new open-source research kernel designed to scale near perfectly on large cache-coherent 64-bit multiprocessor systems. The techniques are generally applicable, and many of them have been integrated into the Linux Trace Toolkit. In this paper, we describe the implementation of the infrastructure, how we used the facility, e.g., analyzing lock contention, to understand and achieve K42´s scalable performance, and the lessons we learned. The infrastructure has been invaluable to achieving great scalability.
Keywords :
Databases; Debugging; Kernel; Libraries; Linux; Monitoring; Multiprocessing systems; Open source software; Operating systems; Performance analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Supercomputing, 2003 ACM/IEEE Conference
Print_ISBN :
1-58113-695-1
Type :
conf
DOI :
10.1109/SC.2003.10049
Filename :
1592906
Link To Document :
بازگشت