Title :
Measurement-based availability analysis of Unix systems in a distributed environment
Author :
Simache, Cristina ; Kaâniche, Mohamed
Author_Institution :
Lab. d´´Autom. et d´´Anal. des Syst., CNRS, Toulouse, France
Abstract :
This paper presents a measurement-based availability study of networked Unix systems, based on data collected during 11 months from 298 workstations and servers interconnected through a local area computing network. The data corresponds to event logs recorded by the Unix operating system via the Syslogd daemon. Our study focuses on the identification of machine reboots and the evaluation of statistical measures characterizing: (a) the distribution of reboots (per machine, time), (b) the distribution of uptimes and downtimes associated to these reboots, (c) the availability of machines including workstations and servers, and (d) error dependencies between clients and servers.
Keywords :
Unix; computer network management; local area networks; reliability; availability of machines; availability study; clients; downtimes; error dependencies; local area network; machine reboots; networked Unix systems; servers; statistical measures; uptimes; workstations; Area measurement; Availability; Computer networks; Distributed computing; Error analysis; Failure analysis; Network servers; Operating systems; Time measurement; Workstations;
Conference_Titel :
Software Reliability Engineering, 2001. ISSRE 2001. Proceedings. 12th International Symposium on
Print_ISBN :
0-7695-1306-9
DOI :
10.1109/ISSRE.2001.989489