DocumentCode
259156
Title
Associative Criteria in Mutually Dependent Markov Decision Processes
Author
Fujita, Takashi
Author_Institution
Grad. Sch. of Eng., Kyushu Inst. of Technol., Kitakyushu, Japan
fYear
2014
fDate
Aug. 31 2014-Sept. 4 2014
Firstpage
147
Lastpage
150
Abstract
In this paper, we consider associative criteria in mutually dependent Markov decision processes (MDMDP). The MDMDP model is structured upon two types of finite-stage Markov decision process: main-process and sub-process. At each stage, the reward in one process is given by the optimal value of the alternative process problem, whose initial state is determined by the current state and decision in the original process. We introduce an associative criterion to each MDMDP and derive mutually dependent recursive equations by dynamic programming with an invariant imbedding technique.
Keywords
Markov processes; decision making; dynamic programming; MDMDP model; associative criteria; dynamic programming; finite-stage Markov decision process; invariant imbedding technique; mutually dependent Markov decision process; mutually dependent recursive equations; Decision making; Dynamic programming; Equations; Informatics; Markov processes; Mathematical model; Presses; Associative Reward System; Dynamic Programming; Markov Decision Process;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Applied Informatics (IIAIAAI), 2014 IIAI 3rd International Conference on
Conference_Location
Kitakyushu
Print_ISBN
978-1-4799-4174-2
Type
conf
DOI
10.1109/IIAI-AAI.2014.39
Filename
6913283
Link To Document