• DocumentCode
    661942
  • Title

    A policy-improving system with a mixture probability and clustering distributions to unknown 3d-environments

  • Author

    Phommasak, Uthai ; Kitakoshi, Daisuke ; Shioya, Hiroyuki

  • Author_Institution
    Div. of Inf. & Electron., Muroran Inst. of Technol., Muroran, Japan
  • fYear
    2013
  • fDate
    4-6 Sept. 2013
  • Firstpage
    381
  • Lastpage
    386
  • Abstract
    There are many proposed policy-improving system of Reinforcement Learning (RL) agents that effective in quickly adapting to environmental change by using many statistical methods, such as using a Mixture Model of Bayesian network, using Mixture Probability and Clustering Distribution, etc. However, by using a mixture model of Bayesian network, this system increase the computational complexity that make the control of the computational complexity becomes a necessary problem. On the other hand, by using mixture probability and clustering distribution, even though the computational complexity can be controlled and simultaneously maintain the system´s performance, the examination of computational complexity load and the adaptation performance to more complex environments such as 3D-environments are required. In this paper, we concentrate on the policy-improving system by using mixture probability and clustering distributions. We introduce new parameters and the modified reward process for experiments on 3D-environments, and then investigate and discuss the performance of our proposed system from the results.
  • Keywords
    belief networks; computational complexity; learning (artificial intelligence); multi-agent systems; statistical distributions; Bayesian network; RL agents; clustering distributions; computational complexity; mixture probability; policy-improving system; reinforcement learning; statistical methods; Bayes methods; Computational complexity; Computational modeling; Computer science; DH-HEMTs; Joints; Statistical analysis; Clustering; Hellinger distance; Mixture Probability; Profit-sharing method; Reinforcement learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Engineering Conference (ICSEC), 2013 International
  • Conference_Location
    Nakorn Pathom
  • Print_ISBN
    978-1-4673-5322-9
  • Type

    conf

  • DOI
    10.1109/ICSEC.2013.6694813
  • Filename
    6694813