DocumentCode :
2516115
Title :
Non-stop monitoring and debugging on shared-memory multiprocessors
Author :
Stewart, Darlene A. ; Gentleman, W. Morven
Author_Institution :
Nat. Res. Council of Canada, Ottawa, Ont., Canada
fYear :
1997
fDate :
17-18 May 1997
Firstpage :
263
Lastpage :
269
Abstract :
Monitoring and debugging parallel programs is a difficult activity. There are many situations where the traditional “stop the world, I want to get off” approach to debugging is simply unsuitable. Frequently, nonintrusive monitoring of the program execution is more productive in locating sources of error and also in monitoring “correct” programs for such purposes as performance measurement and tuning. This paper presents a number of space- and time-efficient tools and techniques to support nonintrusive, non-stop monitoring and debugging of parallel programs running on a shared-memory multiprocessor. The techniques include the use of spy tasks, circular history buffers, vectors of use bits, and data structure audits. Particular emphasis is placed on issues that pertain to parallel computing, such as dealing with concurrent execution, shared memory and data caches
Keywords :
online operation; parallel programming; program debugging; real-time systems; shared memory systems; system monitoring; circular history buffers; concurrent execution; data caches; data structure audits; debugging; nonintrusive monitoring; nonstop monitoring; parallel programs; performance measurement; shared-memory multiprocessors; space-efficient tools; spy tasks; time-efficient tools; vectors of use bits; Councils; Data structures; Debugging; Error analysis; Error correction; History; Measurement; Monitoring; Parallel processing; Probes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Engineering for Parallel and Distributed Systems, 1997. Proceedings., Second International Workshop on
Conference_Location :
Boston, MA
Print_ISBN :
0-8186-8043-1
Type :
conf
DOI :
10.1109/PDSE.1997.596845
Filename :
596845
Link To Document :
بازگشت