• DocumentCode
    169121
  • Title

    An Anomaly Detection Approach for Scale-Out Storage Systems

  • Author

    Silvestre, Guthemberg ; Sauvanaud, Carla ; Kaaniche, M. ; Kanoun, Karama

  • Author_Institution
    LAAS, Toulouse, France
  • fYear
    2014
  • fDate
    22-24 Oct. 2014
  • Firstpage
    294
  • Lastpage
    301
  • Abstract
    Scale-out storage systems (SoSS) have become increasingly important for meeting availability requirements of web services in cloud platforms. To enhance data availability, SoSS rely on a variety of built-in fault-tolerant mechanisms, including replication, redundant network topologies, advanced request scheduling, and other failover techniques. However, performance issues in cloud services still remain one of the main causes of discontentment among their tenants. In this paper, we propose an anomaly detection approach for SoSS that predicts cloud anomalies caused by memory and network faults. To evaluate our prediction model, we built a testbed simulating a virtual data center using VMware. Experimental results confirm that the injected faults are likely to undermine the data availability in SoSS. They suggest that although unsupervised learning has been the most common method for anomaly detection, a supervisedbased implementation of the same model reduces the false positive rate by roughly 10%. Our analysis also points out that probing SoSS-specific monitoring data at the VM-level contributes to improve the anomaly prediction efficiency.
  • Keywords
    Web services; cloud computing; fault tolerant computing; security of data; storage management; unsupervised learning; virtualisation; SoSS; VMware; Web services; anomaly detection approach; anomaly prediction efficiency; availability requirements; built-in fault-tolerant mechanisms; cloud platforms; cloud services; data availability; failover techniques; faults injection; memory faults; network faults; prediction model; redundant network topologies; replication; request scheduling; scale-out storage systems; unsupervised learning; virtual data center; Availability; Loss measurement; Monitoring; Predictive models; Throughput; Training; Unsupervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Architecture and High Performance Computing (SBAC-PAD), 2014 IEEE 26th International Symposium on
  • Conference_Location
    Jussieu
  • ISSN
    1550-6533
  • Type

    conf

  • DOI
    10.1109/SBAC-PAD.2014.42
  • Filename
    6970677