DocumentCode :
2552398
Title :
Learning acceleration by policy sharing
Author :
Hwang, Kao-Shing ; Chen, Yu-Jen ; Jiang, Wei-Cheng
Author_Institution :
Dept. of Electr. Eng., Nat. Chung Cheng Univ., Chiayi, Taiwan
fYear :
2011
fDate :
21-25 June 2011
Firstpage :
725
Lastpage :
729
Abstract :
Reinforcement learning is one of the more prominent machine learning technologies due to its unsupervised learning structure and ability to continually learn, even in a dynamic operating environment. Applying this learning to cooperative multi-agent systems not only allows each individual agent to learn from its own experience, but also offers the opportunity for the individual agents to learn from the other agents in the system to increase the speed of learning can be accelerated. In the proposed learning algorithm, an agent store its experience in terms of state aggregation implemented with a decision tree, such that policy sharing between multi-agent is eventually accomplished by merging different decision trees between peers. Unlike lookup tables which have homogeneous structure for state aggregations, decision trees carried in agents are with heterogeneous structure. This work executes policy sharing between cooperative agents by means of forming a hyper structure from their trees instead of merging whole trees violently. The proposed scheme initially translates the whole decision tree from an agent to others. Based on the evidence, only partial leaf nodes hold helpful experience for policy sharing. The proposed method inducts a hyper decision tree by a great mount of samples which are sampled from the shared nodes. Results from simulations in multi-agent cooperative domain illustrate that the proposed algorithms perform better than the one without sharing.
Keywords :
decision trees; multi-agent systems; peer-to-peer computing; unsupervised learning; cooperative multi-agent systems; decision tree; dynamic operating environment; hyper structure; machine learning; policy sharing; reinforcement learning; shared nodes; unsupervised learning structure; Decision trees; Learning; Machine learning; Merging; Mobile robots; Multiagent systems; Cooperation; Mobile robot; Multi-agent; Reinforcement learning; Sharing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Control and Automation (WCICA), 2011 9th World Congress on
Conference_Location :
Taipei
Print_ISBN :
978-1-61284-698-9
Type :
conf
DOI :
10.1109/WCICA.2011.5970609
Filename :
5970609
Link To Document :
بازگشت