• DocumentCode
    2178535
  • Title

    Optimal software rejuvenation policies

  • Author

    Agepati, R. ; Gundala, N. ; Amari, S.V.

  • Author_Institution
    HSBC Software Dev. (I) Pvt. Ltd., Hyderabad, India
  • fYear
    2013
  • fDate
    28-31 Jan. 2013
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    Studies on software reliability and performance reveal that long-running software systems show an increasing failure rate and/or a progressive degradation of their performance. This phenomenon is referred to as software aging, and it may eventually lead to an unacceptable level of system performance degradation and/or crash failure. A technique called software rejuvenation can be used to counteract software aging. This technique involves occasionally terminating an application or a system, cleaning its internal state and/or its environment, and restarting it. By removing the accrued error conditions and freeing up or defragmenting operating system resources, this technique proactively prevents unexpected future system outages. Unlike downtime caused by sudden failure occurrences, the downtime related to software rejuvenation can be scheduled at the discretion of the user or administrator, typically during the middle of the night or over weekends. This paper presents a generalized condition-based software rejuvenation model that is applicable to a wide range of applications. The rejuvenation model includes a stochastic deterioration process, a set of rejuvenation actions and their effects, and a schedule inspection policy that identifies the system deterioration. The optimal rejuvenation policy that minimizes the overall cost associated with the system is obtained using Markov decision processes. With minor modifications, the model can also be used for maximizing the system availability/capacity. This paper demonstrates the proposed model and the optimization procedure using an example of a web server subject to a two-dimensional software degradation process.
  • Keywords
    Markov processes; software reliability; Markov decision process; Web server; condition-based software rejuvenation; crash failure; schedule inspection policy; software aging; software degradation process; software performance; software rejuvenation policy; software reliability; system performance degradation; Availability; Degradation; Inspection; Maintenance engineering; Markov processes; Servers; Software; optimization; redundancy optimization; resource constraints; software aging; software rejuvenation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Reliability and Maintainability Symposium (RAMS), 2013 Proceedings - Annual
  • Conference_Location
    Orlando, FL
  • ISSN
    0149-144X
  • Print_ISBN
    978-1-4673-4709-9
  • Type

    conf

  • DOI
    10.1109/RAMS.2013.6517695
  • Filename
    6517695