DocumentCode
614633
Title
Adiabatic Markov Decision Process with application to queuing systems
Author
Thai Duong ; Duong Nguyen-Huu ; Thinh Nguyen
Author_Institution
Sch. of Electr. Eng. & Comput. Sci., Oregon State Univ., Corvallis, OR, USA
fYear
2013
fDate
20-22 March 2013
Firstpage
1
Lastpage
6
Abstract
Markov Decision Process (MDP) is a well-known framework for devising the optimal decision making strategies under uncertainty. Typically, the decision maker assumes a stationary environment which is characterized by a time-invariant transition probability matrix. However, in many realworld scenarios, this assumption is not justified, thus the optimal strategy might not provide the expected performance. In this paper, we study the performance of the classic Value Iteration (VI) algorithm for solving an MDP problem under non-stationary environments. Specifically, the non-stationary environment is modeled as a sequence of time-variant transition probability matrices governed by an adiabatic evolution inspired from quantum mechanics. We characterize the performance of the VI algorithm subject to the rate of change of the underlying environment. The performance is measured in terms of the convergence rate to the optimal average reward. We show two examples of queuing systems that make use of our analysis framework.
Keywords
Markov processes; decision making; iterative methods; matrix algebra; probability; queueing theory; MDP problem; adiabatic Markov decision process; classic VI algorithm; classic value iteration algorithm; nonstationary environments; optimal decision making strategies; optimal strategy; queuing systems; stationary environment; time-variant transition probability matrices; Algorithm design and analysis; Convergence; Estimation; Heuristic algorithms; Markov processes; Upper bound; Vectors; Adiabatic; Markov Decision Process; Value Iteration;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Sciences and Systems (CISS), 2013 47th Annual Conference on
Conference_Location
Baltimore, MD
Print_ISBN
978-1-4673-5237-6
Electronic_ISBN
978-1-4673-5238-3
Type
conf
DOI
10.1109/CISS.2013.6552321
Filename
6552321
Link To Document