DocumentCode
2176869
Title
Analysis of failure and recovery rates in a wireless telecommunications system
Author
Matz, Steven M. ; Votta, Lawrence G. ; Malkawi, Mohammad
Author_Institution
Motorola Inc., Arlington Heights, IL, USA
fYear
2002
fDate
2002
Firstpage
687
Lastpage
693
Abstract
We derive estimates of mean time to failure and mean time to recover/repair for both hardware and software in a large wireless telecommunications system, based on six months of manually recorded outage data. The observed failure and recovery distributions are not consistent with simple exponential processes. The data can be described by Weibull or two-stage hyper-exponential distributed processes. The duration distributions for scheduled and unscheduled software outages have very different characteristics. The complex distributions observed may be the composition of simple independent processes which cannot be separated in this data set due to a lack of adequately detailed information or proper characterization of outage causes. In this system we found a coverage of ∼98% for autorecovery from unscheduled software failures with an autorepair fraction of ∼36%.
Keywords
Weibull distribution; computer communications software; system recovery; wireless LAN; duration distributions; failure and recovery; mean time to failure; mean time to recover; software failures; software outages; wireless telecommunications system; Availability; Base stations; Control systems; Failure analysis; Hardware; Humans; Parameter estimation; Particle measurements; Predictive models; Shape;
fLanguage
English
Publisher
ieee
Conference_Titel
Dependable Systems and Networks, 2002. DSN 2002. Proceedings. International Conference on
Print_ISBN
0-7695-1101-5
Type
conf
DOI
10.1109/DSN.2002.1029014
Filename
1029014
Link To Document