Title :
Discovery of root causes of system failures by means of analysis of repair records
Author :
Lu, Tsai-Ching ; Przytula, K. Wojtek
Author_Institution :
LLC, HRL Labs., Malibu, CA, USA
Abstract :
Repair records constitute an invaluable source of information for early detection of systematic failures, despite issues such as inherent noise and missing data. In this paper, we present methodology and algorithms for mining repair records to discover root causes of system failure. We employ both domain-driven and data-driven clustering approaches to reduce data noise and to consider system failures at different level of granularity. We use probabilistic graphical models for identifying potential causal relations among clusters of repair records. The models are acquired by learning from data. Our methodology and algorithms are captured in a comprehensive software environment, which assists analysts in performing an interactive discovery of root causes of failures based on repair records.
Keywords :
data handling; engineering computing; failure analysis; learning (artificial intelligence); maintenance engineering; pattern clustering; probability; records management; reliability theory; data noise; data-driven clustering; domain-driven clustering; information source; learning; probabilistic graphical models; repair records; root causes; software environment; system failures; Clustering algorithms; Failure analysis; Graphical models; Information resources; Noise level; Noise reduction; Performance analysis; Software algorithms; Software performance; Working environment noise;
Conference_Titel :
Aerospace Conference, 2010 IEEE
Conference_Location :
Big Sky, MT
Print_ISBN :
978-1-4244-3887-7
Electronic_ISBN :
1095-323X
DOI :
10.1109/AERO.2010.5446824