مرکز منطقه ای اطلاع رساني علوم و فناوري - Adaptive Play Q-Learning with Initial Heuristic Approximation

DocumentCode :

2624283

Title :

Adaptive Play Q-Learning with Initial Heuristic Approximation

Author :

Burkov, Andriy ; Chaib-Draa, Brahim

Author_Institution :

Departement d´´informatique et de genie logiciel, Univ. Laval, Sainte-Foy, Que.

fYear :

2007

fDate :

10-14 April 2007

Firstpage :

1749

Lastpage :

1754

Abstract :

The problem of an effective coordination of multiple autonomous robots is one of the most important tasks of the modern robotics. In turn, it is well known that the learning to coordinate multiple autonomous agents in a multiagent system is one of the most complex challenges of the state-of-the-art intelligent system design. Principally, this is because of the exponential growth of the environment\´s dimensionality with the number of learning agents. This challenge is known as "curse of dimensionality", and relates to the fact that the dimensionality of the multiagent coordination problem is exponential in the number of learning agents, because each state of the system is a joint state of all agents and each action is a joint action composed of actions of each agent. In this paper, we address this problem for the restricted class of environments known as goal-directed stochastic games with action-penalty representation. We use a single-agent problem solution as a heuristic approximation of the agents\´ initial preferences and, by so doing, we restrict to a great extent the space of multiagent learning. We show theoretically the correctness of such an initialization, and the results of experiments in a well-known two-robot grid world problem show that there is a significant reduction of complexity of the learning process.

Keywords :

adaptive systems; intelligent robots; learning (artificial intelligence); learning systems; multi-robot systems; stochastic games; action-penalty representation; adaptive play Q-learning; goal-directed stochastic games; heuristic approximation; intelligent system design; learning agents; multiagent system; multiple autonomous robot coordination; single-agent problem; two-robot grid world problem; Autonomous agents; Intelligent agent; Intelligent robots; Intelligent systems; Multiagent systems; Orbital robotics; Robot kinematics; Robotics and automation; State-space methods; Stochastic processes;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Robotics and Automation, 2007 IEEE International Conference on

Conference_Location :

Roma

ISSN :

1050-4729

Print_ISBN :

1-4244-0601-3

Electronic_ISBN :

1050-4729

Type :

conf

DOI :

10.1109/ROBOT.2007.363575

Filename :

4209339

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2624283