• DocumentCode
    1605619
  • Title

    Adaptive Profiling for Root-Cause Analysis of Performance Anomalies in Web-Based Applications

  • Author

    Magalhães, João Paulo ; Silva, Luis Moura

  • Author_Institution
    CIICESI, ESTGF-Porto Polytech. Inst., Felgueiras, Portugal
  • fYear
    2011
  • Firstpage
    171
  • Lastpage
    178
  • Abstract
    The most important factor in the assessment of the availability of a system is the mean-time to repair (MTTR). The lower the MTTR the higher the availability. A significant portion of the MTTR is spent in the detection and localization of the cause of the failure. One possible method that may provide good results in the root-cause analysis of application failures is run-time profiling. The major drawback of run-time profiling is the performance impact. In this paper we describe two algorithms for selective and adaptive profiling of web-based applications. The algorithms make use of a dynamic profiling interval and are mainly triggered when some of the transactions start presenting some symptoms of performance anomaly. The algorithms were tested under different types of degradation scenarios and compared to static sampling strategies. We observed through experimentation that the pinpoint of performance anomalies, supported by the data collected using the adaptive profiling algorithms, stills timely as with full-profiling while the response time overhead is reduced in almost 60%. When compared to a non-profiled version the response time overhead is less than 1.5%. These results show the viability of using run-time profiling to support quickly detection and pinpointing of performance anomalies and enable timely recovery.
  • Keywords
    Internet; system recovery; MTTR; Web-based application; adaptive profiling algorithm; application failures; dynamic profiling interval; mean time to repair; performance anomaly pinpointing; response time overhead reduction; root-cause analysis; run-time profiling; selective profiling; system recovery; Algorithm design and analysis; Correlation; Degradation; Heuristic algorithms; Monitoring; Servers; Time factors; application profiling; dependability; monitoring; performance anomalies; root-cause analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Network Computing and Applications (NCA), 2011 10th IEEE International Symposium on
  • Conference_Location
    Cambridge, MA
  • Print_ISBN
    978-1-4577-1052-0
  • Electronic_ISBN
    978-0-7695-4489-2
  • Type

    conf

  • DOI
    10.1109/NCA.2011.30
  • Filename
    6038599