Title :
Classification and Impact Analysis of Faults in Automated System Management
Author :
McLarnon, Barry ; Robinson, Philip ; Sage, Paul ; Milligan, Peter
Author_Institution :
SAP Res. CEC Belfast, Belfast, UK
Abstract :
The reliability of automated system management solutions will increase in importance as the use of cloud computing and data centres expands. As part of a study to improve reliability, this paper provides a classification of faults that can occur in automated system management and proposes a method for determining the severity of such faults. A baseline deployment is compared with an alternate proposed configuration to determine the difference in reliability. The results gained show a significant improvement over the baseline. While it is still in development, the method is able to determine and compare the reliability of deployment configurations from early in the design process.
Keywords :
Internet; computer centres; computer network reliability; fault tolerant computing; automated system management; baseline deployment; cloud computing; data centre; faults classification; faults impact analysis; Analytical models; Computer architecture; Humans; Reliability engineering; Software; Software reliability; adaptability; automation; dependability; management; reliability;
Conference_Titel :
Dependability (DEPEND), 2010 Third International Conference on
Conference_Location :
Venice
Print_ISBN :
978-1-4244-7530-8
DOI :
10.1109/DEPEND.2010.34