Title :
Online System Problem Detection by Mining Patterns of Console Logs
Author :
Xu, Wei ; Huang, Ling ; Fox, Armando ; Patterson, David ; Jordan, Michael
Author_Institution :
EECS Dept., UC Berkeley, Berkeley, CA, USA
Abstract :
We describe a novel application of using data mining and statistical learning methods to automatically monitor and detect abnormal execution traces from console logs in an online setting. Different from existing solutions, we use a two stage detection system. The first stage uses frequent pattern mining and distribution estimation techniques to capture the dominant patterns (both frequent sequences and time duration). The second stage use principal component analysis based anomaly detection technique to identify actual problems. Using real system data from a 203-node Hadoop cluster, we show that we can not only achieve highly accurate and fast problem detection, but also help operators better understand execution patterns in their system.
Keywords :
data mining; learning (artificial intelligence); statistical analysis; 203-node Hadoop cluster; anomaly detection; console logs; data mining patterns; distribution estimation; execution patterns; frequent pattern mining; online setting; online system problem detection; principal component analysis; real system data; stage detection system; statistical learning method; Computerized monitoring; Condition monitoring; Data mining; Machine learning; Principal component analysis; Software maintenance; Statistical learning; USA Councils; Web and internet services; Web server; console logs; logs; monitoring; pattern mining; problem detection; system management;
Conference_Titel :
Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4244-5242-2
Electronic_ISBN :
1550-4786
DOI :
10.1109/ICDM.2009.19