• DocumentCode
    2283920
  • Title

    Using machine learning for non-intrusive modeling and prediction of software aging

  • Author

    Andrzejak, Artur ; Silva, Luis

  • Author_Institution
    Zuse Inst. Berlin, Berlin
  • fYear
    2008
  • fDate
    7-11 April 2008
  • Firstpage
    25
  • Lastpage
    32
  • Abstract
    The wide-spread phenomenon of software (running image) aging is known to cause performance degradation, transient failures or even crashes of applications. In this work we describe first a method for monitoring and modeling of performance degradation in SOA applications, particularly application servers. This method works for a large class of the aging processes caused by resource depletion (e.g. memory leaks). It can be deployed non-intrusively in a production environment, under arbitrary service request distributions. Based on this schema we investigate in the second part of the paper how machine learning (classification) algorithms can be used for proactive detection of performance degradation or sudden drops caused by aging. We leverage the predictive power of these algorithms with several techniques to make the measurement-based aging models more adaptive and more robust against transient failures. We evaluate several state-of-the-art classification methods for their accuracy and computational efficiency in this scenario. The studies are performed on a data set generated by a TPC-W benchmark instrumented with a memory leak injector. The results show that the probing method yields accurate aging models with low overhead and the machine learning approach gives statistically significant short-term predictions of degrading application performance. Both approaches can be used directly to fight aging via adaptive software rejuvenation (restart of the application), for operator alerting, or for short-term capacity planning.
  • Keywords
    learning (artificial intelligence); resource allocation; software architecture; software performance evaluation; storage management; system monitoring; SOA applications; adaptive software rejuvenation; application crash; application servers; capacity planning; machine learning; memory leak injector; memory leaks; nonintrusive modeling; performance degradation; probing method; resource depletion; service request distribution; software aging prediction; system monitoring; transient failure; Aging; Application software; Computer crashes; Condition monitoring; Degradation; Machine learning; Machine learning algorithms; Predictive models; Service oriented architecture; Software performance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Network Operations and Management Symposium, 2008. NOMS 2008. IEEE
  • Conference_Location
    Salvador, Bahia
  • ISSN
    1542-1201
  • Print_ISBN
    978-1-4244-2065-0
  • Electronic_ISBN
    1542-1201
  • Type

    conf

  • DOI
    10.1109/NOMS.2008.4575113
  • Filename
    4575113