Title :
Application-dependent dynamic monitoring of distributed and parallel systems
Author :
Ogle, David M. ; Schwan, Karsten ; Snodgrass, Richard
Author_Institution :
Ohio State Univ., Columbus, OH, USA
fDate :
7/1/1993 12:00:00 AM
Abstract :
Achieving high performance for parallel or distributed programs often requires substantial amounts of information about the programs themselves, about the systems on which they are executing, and about specific program runs. The monitoring system that collects, analyzes, and makes application-dependent monitoring information available to the programmer and to the executing program is presented. The system may be used for off-line program analysis, for on-line debugging, and for making on-line, dynamic changes to parallel or distributed programs to enhance their performance. The authors use a high-level, uniform data model for the representation of program information and monitoring data. They show how this model may be used for the specification of program views and attributes for monitoring, and demonstrate how such specifications can be translated into efficient, program-specific monitoring code that uses alternative mechanisms for the distributed analysis and collection to be performed for the specified views. The model´s utility has been demonstrated on a wide variety of parallel machines
Keywords :
distributed processing; parallel programming; program debugging; program testing; application dependent dynamic monitoring; attributes; distributed analysis; distributed programs; distributed systems; off-line program analysis; on-line debugging; parallel machines; parallel systems; program-specific monitoring code; specification; uniform data model; Data models; Debugging; Information analysis; Local area networks; Monitoring; Operating systems; Parallel machines; Performance analysis; Programming profession; Writing;
Journal_Title :
Parallel and Distributed Systems, IEEE Transactions on