DocumentCode :
3256622
Title :
Distributing rewards by strategic knowledge based on Nash-Q learning
Author :
Igoshi, Kazuo ; Miura, Takao ; Shioya, Isamu
Author_Institution :
Dept.of Electr. & Electr. Eng., Hosei Univ., Koganei
fYear :
2008
fDate :
4-6 Aug. 2008
Firstpage :
458
Lastpage :
463
Abstract :
In this investigation, we examine collaboration approach to reward distribution in repeated general-sum stochastic games by multiple game players in terms of position and rewards. There have been several investigation of reward distribution discussed so far, and reinforcement has been considered useful since no knowledge is needed in advanced and better decision can be extracted while learning. Among others, Q-learning has been paid much attention under single agent environment. However, under multi-agent environment, we donpsilat have sharp targets to this problem, what is the most optimal principle? In this work, we discuss how to distribute reward thoroughly by considering as general stochastic games based on theory of games. That is, we introduce Nash-Q approach which combines Nash equilibrium with Q-learning. We show the new approach provides us with new strategic solution. We discuss some experiments of rather complicated games (game of life) to see the usefulness of the approach.
Keywords :
learning (artificial intelligence); stochastic games; Nash equilibrium; Nash-Q learning; repeated general-sum stochastic games; reward distribution; strategic knowledge; Collaboration; Game theory; Nash equilibrium; Stochastic processes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications of Digital Information and Web Technologies, 2008. ICADIWT 2008. First International Conference on the
Conference_Location :
Ostrava
Print_ISBN :
978-1-4244-2623-2
Electronic_ISBN :
978-1-4244-2624-9
Type :
conf
DOI :
10.1109/ICADIWT.2008.4664393
Filename :
4664393
Link To Document :
بازگشت