• DocumentCode
    1609364
  • Title

    A State Space Filter for Reinforcement Learning in POMIDPs - Application to a Continuous State Space

  • Author

    Nagayoshi, Masato ; Murao, Hajimne ; Tamaki, Hisashi

  • Author_Institution
    Grad. Sch. of Sci. & Tech., Kobe Univ.
  • fYear
    2006
  • Firstpage
    6037
  • Lastpage
    6042
  • Abstract
    This paper presents a technique to deal with both discrete and continuous state space systems in POMDPs for reinforcement learning while keeping the state space of an agent compact. First, our computational model for MDP environments, where a concept of "state space filtering" has been introduced and constructed to make properly the state space of an agent smaller by referring to "entropy" calculated based on the state-action mapping, is extended to be applicable in POMDP environments by introducing the mechanism of utilizing effectively of history information. Then, it is possible to deal with a continuous state space as well as a discrete state space. In this, the mechanism of adjusting the amount of history information is also introduced so that the state space of an agent should be compact. Moreover, some computational experiments with a robot navigation problem with a continuous state space have been carried out. The potential and the effectiveness of the extended approach have been confirmed through these experiments
  • Keywords
    Markov processes; decision theory; filtering theory; learning (artificial intelligence); observability; state-space methods; continuous state space filter; partially observable Markov decision process; reinforcement learning; robot navigation problem; Computational modeling; Control systems; Entropy; Filtering; Filters; History; Learning; Motion planning; Orbital robotics; State-space methods; POMIDPs; entropy; reinforcement leamig; state space design; state space filtering continuous state space;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    SICE-ICASE, 2006. International Joint Conference
  • Conference_Location
    Busan
  • Print_ISBN
    89-950038-4-7
  • Electronic_ISBN
    89-950038-5-5
  • Type

    conf

  • DOI
    10.1109/SICE.2006.315203
  • Filename
    4108660