Title :
A self-evolving anomaly detection framework for developing highly dependable utility clouds
Author :
Pannu, H.S. ; Jianguo Liu ; Song Fu
Author_Institution :
Dept. of Math., Univ. of North Texas, Denton, TX, USA
Abstract :
Utility clouds continue to grow in scale and in the complexity of their components and interactions, which introduces a key challenge to failure and resource management for highly dependable cloud computing. Autonomic anomaly detection is a crucial technique for understanding emergent, cloud-wide phenomena and self-managing cloud resources for system-level dependability assurance. To identify anomalies, we need to monitor the system execution and collect health-related runtime performance data. These data are usually unlabeled and a prior failure history is not always available in production systems, especially for newly deployed or managed utility clouds. In this paper, we present a self-evolving anomaly detection framework with mechanisms for dependability assurance in utility clouds. No prior failure history is required. The detector self-evolves by recursively exploring newly generated verified detection results for future anomaly identification. Statistical learning technologies are exploited in detector determination and working dataset selection. Experimental results in an institute-wide cloud computing system show that the detection accuracy improves as it evolves. With self-evolvement, the detector can achieve 92.1% detection sensitivity and 83.8% detection specificity, which makes it well suitable for building highly dependable utility clouds.
Keywords :
cloud computing; data acquisition; learning (artificial intelligence); resource allocation; software performance evaluation; software reliability; statistical analysis; autonomic anomaly detection accuracy; cloud-wide phenomena; dependability assurance; detector determination; emergent cloud-wide phenomena; failure management; health-related runtime performance data collection; highly dependable utility cloud computing; institute-wide cloud computing system; resource management; self-evolving anomaly detection framework; self-managing cloud resource; statistical learning technologies; system-level dependability assurance; working dataset selection; Anomaly identification; Autonomic management; Cloud computing; Dependable systems; Self evolvement;
Conference_Titel :
Global Communications Conference (GLOBECOM), 2012 IEEE
Conference_Location :
Anaheim, CA
Print_ISBN :
978-1-4673-0920-2
Electronic_ISBN :
1930-529X
DOI :
10.1109/GLOCOM.2012.6503343