A policy-improving system with a mixture probability and clustering distributions to unknown 3d-environments

Author

Phommasak, Uthai ; Kitakoshi, Daisuke ; Shioya, Hiroyuki

Author_Institution

Div. of Inf. & Electron., Muroran Inst. of Technol., Muroran, Japan

fYear

2013

fDate

4-6 Sept. 2013

Firstpage

381

Lastpage

386

Abstract

There are many proposed policy-improving system of Reinforcement Learning (RL) agents that effective in quickly adapting to environmental change by using many statistical methods, such as using a Mixture Model of Bayesian network, using Mixture Probability and Clustering Distribution, etc. However, by using a mixture model of Bayesian network, this system increase the computational complexity that make the control of the computational complexity becomes a necessary problem. On the other hand, by using mixture probability and clustering distribution, even though the computational complexity can be controlled and simultaneously maintain the system´s performance, the examination of computational complexity load and the adaptation performance to more complex environments such as 3D-environments are required. In this paper, we concentrate on the policy-improving system by using mixture probability and clustering distributions. We introduce new parameters and the modified reward process for experiments on 3D-environments, and then investigate and discuss the performance of our proposed system from the results.

Keywords

belief networks; computational complexity; learning (artificial intelligence); multi-agent systems; statistical distributions; Bayesian network; RL agents; clustering distributions; computational complexity; mixture probability; policy-improving system; reinforcement learning; statistical methods; Bayes methods; Computational complexity; Computational modeling; Computer science; DH-HEMTs; Joints; Statistical analysis; Clustering; Hellinger distance; Mixture Probability; Profit-sharing method; Reinforcement learning;

fLanguage

English

Publisher

ieee

Conference_Titel

Computer Science and Engineering Conference (ICSEC), 2013 International

Conference_Location

Nakorn Pathom

Print_ISBN

978-1-4673-5322-9

Type

conf

DOI

10.1109/ICSEC.2013.6694813

Filename

6694813