Author :
Hoffmann, Guenther A. ; Trivedi, Kishor S. ; Malek, Miroslaw
Abstract :
Recently, measurement-based studies of software systems have proliferated, reflecting an increasingly empirical focus on system availability, reliability, aging, and fault tolerance. However, it is a nontrivial, error-prone, arduous, and time-consuming task even for experienced system administrators, and statistical analysts to know what a reasonable set of steps should include to model, and successfully predict performance variables, or system failures of a complex software system. Reported results are fragmented, and focus on applying statistical regression techniques to monitored numerical system data. In this paper, we propose a best practice guide for building empirical models based on our experience with forecasting Apache web server performance variables, and forecasting call availability of a real-world telecommunication system. To substantiate the presented guide, and to demonstrate our approach in a step by step manner, we model, and predict the response time, and the amount of free physical memory of an Apache web server system, as well as the call availability of an industrial telecommunication system. Additionally, we present concrete results for a) variable selection where we cross benchmark three procedures, b) empirical model building where we cross benchmark four techniques, and c) sensitivity analysis. This best practice guide intends to assist in configuring modeling approaches systematically for best estimation, and prediction results.
Keywords :
Internet; fault tolerant computing; file servers; large-scale systems; regression analysis; software reliability; Apache Web server performance; complex software system; computing systems; empirical models; fault tolerance; industrial telecommunication system; measurement-based studies; real-world telecommunication system; resource forecasting; software systems; statistical analysts; statistical regression techniques; system administrators; system availability; system reliability; Aging; Availability; Best practices; Failure analysis; Fault tolerant systems; Performance analysis; Predictive models; Software measurement; Software systems; Web server; Apache web server; failure forecasting; monitoring; non-parametric modeling; prediction of resource utilization; quantitative analysis; statistical modeling; telecommunication systems;