Consolidated actor-critic model for partially-observable Markov decision processes

Author

Elhanany, I. ; Niedzwiedz, C. ; Liu, Zhe ; Livingston, S.

Author_Institution

Dept. of Electr. Eng. & Comput. Sci., Univ. of Tennessee, Knoxville, TN

Volume

Issue

fYear

2008

Firstpage

1317

Lastpage

1318

Abstract

A method for consolidating the traditionally separate actor and critic neural networks in temporal difference learning for addressing partially-observable Markov decision processes (POMDPs) is presented. Simulation results for solving a five-state POMDP problem support the claim that the consolidated model achieves higher performance while reducing computational and storage requirements to approximately half those of the traditional approach.

Keywords

Markov processes; decision theory; Markov decision processes; actor-critic model; critic neural networks; temporal difference learning; traditionally separate actor;

fLanguage

English

Journal_Title

Electronics Letters

Publisher

iet

ISSN

0013-5194

Type

jour

DOI

10.1049/el:20081346

Filename

4658763

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=964489