DocumentCode :
259156
Title :
Associative Criteria in Mutually Dependent Markov Decision Processes
Author :
Fujita, Takashi
Author_Institution :
Grad. Sch. of Eng., Kyushu Inst. of Technol., Kitakyushu, Japan
fYear :
2014
fDate :
Aug. 31 2014-Sept. 4 2014
Firstpage :
147
Lastpage :
150
Abstract :
In this paper, we consider associative criteria in mutually dependent Markov decision processes (MDMDP). The MDMDP model is structured upon two types of finite-stage Markov decision process: main-process and sub-process. At each stage, the reward in one process is given by the optimal value of the alternative process problem, whose initial state is determined by the current state and decision in the original process. We introduce an associative criterion to each MDMDP and derive mutually dependent recursive equations by dynamic programming with an invariant imbedding technique.
Keywords :
Markov processes; decision making; dynamic programming; MDMDP model; associative criteria; dynamic programming; finite-stage Markov decision process; invariant imbedding technique; mutually dependent Markov decision process; mutually dependent recursive equations; Decision making; Dynamic programming; Equations; Informatics; Markov processes; Mathematical model; Presses; Associative Reward System; Dynamic Programming; Markov Decision Process;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Applied Informatics (IIAIAAI), 2014 IIAI 3rd International Conference on
Conference_Location :
Kitakyushu
Print_ISBN :
978-1-4799-4174-2
Type :
conf
DOI :
10.1109/IIAI-AAI.2014.39
Filename :
6913283
Link To Document :
بازگشت