مرکز منطقه ای اطلاع رساني علوم و فناوري - Robustness and generalization of model-free learning for robot kinematic control using a nested-hierarchical multi-agent topology

DocumentCode :

3180094

Title :

Robustness and generalization of model-free learning for robot kinematic control using a nested-hierarchical multi-agent topology

Author :

Karigiannis, John N. ; Tzafestas, Costas S.

Author_Institution :

Fac. of Electr. & Comput. Eng., Nat. Tech. Univ. of Athens (NTUA), Athens, Greece

fYear :

2012

fDate :

24-27 June 2012

Firstpage :

1140

Lastpage :

1147

Abstract :

This paper focuses on evaluating the robustness and knowledge generalization properties of a model-free learning mechanism, applied for the kinematic control of robot manipulation chains based on a nested-hierarchical multi-agent architecture. In the proposed topology, the agents correspond to independent degrees-of-freedom (DOF) of the system, managing to gain experience over the task that they collaboratively perform by continuously exploring and exploiting their state-to-action mapping space. Each agent forms a local (partial) view of the global system state and task progress, through a recursive learning process. By organizing the agents in a nested topology, the goal is to facilitate modular scaling to more complex kinematic topologies, with loose control coupling among the agents. Reinforcement learning is applied within each agent, to evolve a local state-to-action mapping in a continuous domain, thus leading to a system that exhibits developmental properties. This work addresses problem settings in the domain of kinematic control of dexterous-redundant robot manipulation systems. The numerical experiments performed consider the case of a single-linkage open kinematic chain, presenting kinematic redundancies given the desired task-goal. The focal issue in these experiments is to assess the capacity of the proposed multi-agent system to progressively and autonomously acquire cooperative sensorimotor skills through a self-learning process, that is, without the use of any explicit model-based planning strategy. In this paper, generalization and robustness properties of the overall multi-agent system are explored. Furthermore, the proposed framework is evaluated in constrained motion tasks, both in static and non-static environments. The computational cost of the proposed multi-agent architecture is also assessed.

Keywords :

dexterous manipulators; hierarchical systems; learning (artificial intelligence); multi-robot systems; redundant manipulators; topology; DOF; constrained motion tasks; cooperative sensorimotor skills; degrees-of-freedom; dexterous-redundant robot manipulation systems; kinematic redundancies; knowledge generalization properties; model-free learning generalization; model-free learning robustness; modular scaling; nested-hierarchical multiagent topology; recursive learning process; reinforcement learning; robot kinematic control; self-learning process; single-linkage open kinematic chain; state-to-action mapping space; Joints; Kinematics; Learning systems; Multiagent systems; Robots; Topology; Training;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Biomedical Robotics and Biomechatronics (BioRob), 2012 4th IEEE RAS & EMBS International Conference on

Conference_Location :

Rome

ISSN :

2155-1774

Print_ISBN :

978-1-4577-1199-2

Type :

conf

DOI :

10.1109/BioRob.2012.6290276

Filename :

6290276

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3180094