DocumentCode
964489
Title
Consolidated actor-critic model for partially-observable Markov decision processes
Author
Elhanany, I. ; Niedzwiedz, C. ; Liu, Zhe ; Livingston, S.
Author_Institution
Dept. of Electr. Eng. & Comput. Sci., Univ. of Tennessee, Knoxville, TN
Volume
44
Issue
22
fYear
2008
Firstpage
1317
Lastpage
1318
Abstract
A method for consolidating the traditionally separate actor and critic neural networks in temporal difference learning for addressing partially-observable Markov decision processes (POMDPs) is presented. Simulation results for solving a five-state POMDP problem support the claim that the consolidated model achieves higher performance while reducing computational and storage requirements to approximately half those of the traditional approach.
Keywords
Markov processes; decision theory; Markov decision processes; actor-critic model; critic neural networks; temporal difference learning; traditionally separate actor;
fLanguage
English
Journal_Title
Electronics Letters
Publisher
iet
ISSN
0013-5194
Type
jour
DOI
10.1049/el:20081346
Filename
4658763
Link To Document