Title :
Automatic Recovery Using Bounded Partially Observable Markov Decision Processes
Author :
Joshi, Kaustubh R. ; Sanders, William H. ; Hiltunen, Matti A. ; Schlichting, Richard D.
Author_Institution :
Coordinated Sci. Lab., Illinois Univ., Urbana, IL
Abstract :
This paper provides a technique, based on partially observable Markov decision processes (POMDPs), for building automatic recovery controllers to guide distributed system recovery in a way that provides provable assurances on the quality of the generated recovery actions even when the diagnostic information may be imprecise. Lower bounds on the cost of recovery are introduced and proved, and it is shown how the characteristics of the recovery process can be used to ensure that the lower bounds converge even on undiscounted models. The bounds used in an appropriate online controller provide it with provable termination properties. Simulation-based experimental results on a realistic e-commerce system demonstrate that the proposed bounds can be improved iteratively, and the resulting controller convincingly outperforms a controller that uses heuristics instead of bounds
Keywords :
Markov processes; computational complexity; decision theory; distributed processing; program diagnostics; system recovery; POMDP technique; automatic recovery controllers; bounded partially observable Markov decision processes; distributed system recovery; lower bounds; Automatic control; Automatic generation control; Computerized monitoring; Condition monitoring; Costs; Distributed control; Fault detection; Fault diagnosis; System recovery; Writing;
Conference_Titel :
Dependable Systems and Networks, 2006. DSN 2006. International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7695-2607-1
DOI :
10.1109/DSN.2006.16