Title :
An approach to selecting metrics for detecting performance problems in information systems
Author :
Hellerstein, Joseph L.
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
Early detection of performance problems is essential to limit their scope and impact. Most commonly, performance problems are detected by applying threshold tests to a set of detection metrics. For example, suppose that disk utilization is a detection metric, and its threshold value is 80%. Then, an alarm is raised if disk utilization exceeds 80%. Unfortunately, the ad hoc manner in which detection metrics are selected often results in false alarms and/or failing to detect problems until serious performance degradations result. To address this situation, we construct rules for metric selection based on analytic comparisons of statistical power equations for five widely used metrics: departure counts (D), number in system (L), response times (R), service times (S), and utilizations (U). These rules are assessed in the context of performance problems in the CPU and paging sub-systems of a production computer system
Keywords :
information systems; performance evaluation; software metrics; software performance evaluation; departure counts; detection metrics; disk utilization; information systems; metrics; number in system; paging sub-systems; performance degradations; performance problems; production computer system; response times; service times; statistical power equations; threshold tests; utilizations; Computerized monitoring; Degradation; Delay; Equations; Information systems; Intelligent networks; Production systems; Productivity; Quality of service; Testing;
Conference_Titel :
Systems Management, 1996., Proceedings of Second IEEE International Workshop on
Conference_Location :
Toronto, Ont.
Print_ISBN :
0-8186-7442-3
DOI :
10.1109/IWSM.1996.534144