• DocumentCode
    2644476
  • Title

    About distributing rewards to a rule with probabilistic state transition

  • Author

    Uemura, Wataru

  • Author_Institution
    Ryukoku Univ., Kyoto
  • fYear
    2007
  • fDate
    17-20 Sept. 2007
  • Firstpage
    2762
  • Lastpage
    2765
  • Abstract
    Profit Sharing is one of the reinforcement learning methods. On Profit Sharing, an agent as a learner distributes rewards to rules selected by the agent after reaching a goal state. If there is a non-deterministic state transition rule, for example a probabilistic one, an agent must consider the estimate value of its rule with the probabilistic state transition. Conventional Profit Sharing does not consider the probabilistic state transition because it distributes same rewards even if the state transition probability is 10%, 1%, and so on. In this paper, we propose the novel Profit Sharing method which considers the probabilistic state transition. In the environment with deterministic state transitions, we show the same performance both the conventional Profit Sharing and proposed Profit Sharing. And show the good performance of proposed Profit Sharing against the conventional Profit Sharing.
  • Keywords
    learning (artificial intelligence); probability; nondeterministic state transition rule; probabilistic state transition; profit sharing; reinforcement learning methods; Informatics; Learning systems; State estimation; Exploration and Exploitation; Profit Sharing; Reinforecement Learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    SICE, 2007 Annual Conference
  • Conference_Location
    Takamatsu
  • Print_ISBN
    978-4-907764-27-2
  • Electronic_ISBN
    978-4-907764-27-2
  • Type

    conf

  • DOI
    10.1109/SICE.2007.4421458
  • Filename
    4421458