• DocumentCode
    964489
  • Title

    Consolidated actor-critic model for partially-observable Markov decision processes

  • Author

    Elhanany, I. ; Niedzwiedz, C. ; Liu, Zhe ; Livingston, S.

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Sci., Univ. of Tennessee, Knoxville, TN
  • Volume
    44
  • Issue
    22
  • fYear
    2008
  • Firstpage
    1317
  • Lastpage
    1318
  • Abstract
    A method for consolidating the traditionally separate actor and critic neural networks in temporal difference learning for addressing partially-observable Markov decision processes (POMDPs) is presented. Simulation results for solving a five-state POMDP problem support the claim that the consolidated model achieves higher performance while reducing computational and storage requirements to approximately half those of the traditional approach.
  • Keywords
    Markov processes; decision theory; Markov decision processes; actor-critic model; critic neural networks; temporal difference learning; traditionally separate actor;
  • fLanguage
    English
  • Journal_Title
    Electronics Letters
  • Publisher
    iet
  • ISSN
    0013-5194
  • Type

    jour

  • DOI
    10.1049/el:20081346
  • Filename
    4658763