DocumentCode
2827981
Title
Application of Reinforcement Learning to Software Rejuvenation
Author
Okamura, Hiroyuki ; Dohi, Tadashi
Author_Institution
Dept. of Inf. Eng., Hiroshima Univ., Higashi-Hiroshima, Japan
fYear
2011
fDate
23-27 March 2011
Firstpage
647
Lastpage
652
Abstract
Software rejuvenation is a preventive and proactive maintenance solution that is particularly useful for counteracting the phenomenon of software aging. Hence, it should be ideally triggered adaptively without the complete knowledge on system failure (degradation) time distribution in operational phase. In this paper we consider an operational software system with multiple degradation levels and derive the optimal software rejuvenation policy maximizing the steady-state system availability, via the semi-Markov decision process. We develop a statistically non-parametric algorithm to estimate the optimal software rejuvenation schedule. Then, the reinforcement learning algorithm, called Q learning, is used for developing an on-line adaptive algorithm. A numerical example is presented to investigate asymptotic behavior of the resulting on-line adaptive algorithm.
Keywords
Markov processes; decision theory; learning (artificial intelligence); nonparametric statistics; preventive maintenance; software maintenance; software reliability; system recovery; Q learning; asymptotic behavior; on-line adaptive algorithm; operational software system; optimal software rejuvenation schedule; preventive maintenance; proactive maintenance; reinforcement learning; semiMarkov decision process; software aging; software rejuvenation; statistically nonparametric algorithm; steady-state system availability; Availability; Schedules; Software algorithms; Software systems; Steady-state; Transient analysis; Q-learning; adaptive algorithm; software rejuvenation;
fLanguage
English
Publisher
ieee
Conference_Titel
Autonomous Decentralized Systems (ISADS), 2011 10th International Symposium on
Conference_Location
Tokyo & Hiroshima
Print_ISBN
978-1-61284-213-4
Type
conf
DOI
10.1109/ISADS.2011.92
Filename
5741421
Link To Document