مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

2552398

Title :

Learning acceleration by policy sharing

Author :

Hwang, Kao-Shing ; Chen, Yu-Jen ; Jiang, Wei-Cheng

Author_Institution :

Dept. of Electr. Eng., Nat. Chung Cheng Univ., Chiayi, Taiwan

fYear :

2011

fDate :

21-25 June 2011

Firstpage :

725

Lastpage :

729

Abstract :

Reinforcement learning is one of the more prominent machine learning technologies due to its unsupervised learning structure and ability to continually learn, even in a dynamic operating environment. Applying this learning to cooperative multi-agent systems not only allows each individual agent to learn from its own experience, but also offers the opportunity for the individual agents to learn from the other agents in the system to increase the speed of learning can be accelerated. In the proposed learning algorithm, an agent store its experience in terms of state aggregation implemented with a decision tree, such that policy sharing between multi-agent is eventually accomplished by merging different decision trees between peers. Unlike lookup tables which have homogeneous structure for state aggregations, decision trees carried in agents are with heterogeneous structure. This work executes policy sharing between cooperative agents by means of forming a hyper structure from their trees instead of merging whole trees violently. The proposed scheme initially translates the whole decision tree from an agent to others. Based on the evidence, only partial leaf nodes hold helpful experience for policy sharing. The proposed method inducts a hyper decision tree by a great mount of samples which are sampled from the shared nodes. Results from simulations in multi-agent cooperative domain illustrate that the proposed algorithms perform better than the one without sharing.

Keywords :

decision trees; multi-agent systems; peer-to-peer computing; unsupervised learning; cooperative multi-agent systems; decision tree; dynamic operating environment; hyper structure; machine learning; policy sharing; reinforcement learning; shared nodes; unsupervised learning structure; Decision trees; Learning; Machine learning; Merging; Mobile robots; Multiagent systems; Cooperation; Mobile robot; Multi-agent; Reinforcement learning; Sharing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Control and Automation (WCICA), 2011 9th World Congress on

Conference_Location :

Taipei

Print_ISBN :

978-1-61284-698-9

Type :

conf

DOI :

10.1109/WCICA.2011.5970609

Filename :

5970609

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2552398