• DocumentCode
    1975048
  • Title

    A self-evolving anomaly detection framework for developing highly dependable utility clouds

  • Author

    Pannu, H.S. ; Jianguo Liu ; Song Fu

  • Author_Institution
    Dept. of Math., Univ. of North Texas, Denton, TX, USA
  • fYear
    2012
  • fDate
    3-7 Dec. 2012
  • Firstpage
    1605
  • Lastpage
    1610
  • Abstract
    Utility clouds continue to grow in scale and in the complexity of their components and interactions, which introduces a key challenge to failure and resource management for highly dependable cloud computing. Autonomic anomaly detection is a crucial technique for understanding emergent, cloud-wide phenomena and self-managing cloud resources for system-level dependability assurance. To identify anomalies, we need to monitor the system execution and collect health-related runtime performance data. These data are usually unlabeled and a prior failure history is not always available in production systems, especially for newly deployed or managed utility clouds. In this paper, we present a self-evolving anomaly detection framework with mechanisms for dependability assurance in utility clouds. No prior failure history is required. The detector self-evolves by recursively exploring newly generated verified detection results for future anomaly identification. Statistical learning technologies are exploited in detector determination and working dataset selection. Experimental results in an institute-wide cloud computing system show that the detection accuracy improves as it evolves. With self-evolvement, the detector can achieve 92.1% detection sensitivity and 83.8% detection specificity, which makes it well suitable for building highly dependable utility clouds.
  • Keywords
    cloud computing; data acquisition; learning (artificial intelligence); resource allocation; software performance evaluation; software reliability; statistical analysis; autonomic anomaly detection accuracy; cloud-wide phenomena; dependability assurance; detector determination; emergent cloud-wide phenomena; failure management; health-related runtime performance data collection; highly dependable utility cloud computing; institute-wide cloud computing system; resource management; self-evolving anomaly detection framework; self-managing cloud resource; statistical learning technologies; system-level dependability assurance; working dataset selection; Anomaly identification; Autonomic management; Cloud computing; Dependable systems; Self evolvement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Global Communications Conference (GLOBECOM), 2012 IEEE
  • Conference_Location
    Anaheim, CA
  • ISSN
    1930-529X
  • Print_ISBN
    978-1-4673-0920-2
  • Electronic_ISBN
    1930-529X
  • Type

    conf

  • DOI
    10.1109/GLOCOM.2012.6503343
  • Filename
    6503343