Title :
Robustness and generalization of model-free learning for robot kinematic control using a nested-hierarchical multi-agent topology
Author :
Karigiannis, John N. ; Tzafestas, Costas S.
Author_Institution :
Fac. of Electr. & Comput. Eng., Nat. Tech. Univ. of Athens (NTUA), Athens, Greece
Abstract :
This paper focuses on evaluating the robustness and knowledge generalization properties of a model-free learning mechanism, applied for the kinematic control of robot manipulation chains based on a nested-hierarchical multi-agent architecture. In the proposed topology, the agents correspond to independent degrees-of-freedom (DOF) of the system, managing to gain experience over the task that they collaboratively perform by continuously exploring and exploiting their state-to-action mapping space. Each agent forms a local (partial) view of the global system state and task progress, through a recursive learning process. By organizing the agents in a nested topology, the goal is to facilitate modular scaling to more complex kinematic topologies, with loose control coupling among the agents. Reinforcement learning is applied within each agent, to evolve a local state-to-action mapping in a continuous domain, thus leading to a system that exhibits developmental properties. This work addresses problem settings in the domain of kinematic control of dexterous-redundant robot manipulation systems. The numerical experiments performed consider the case of a single-linkage open kinematic chain, presenting kinematic redundancies given the desired task-goal. The focal issue in these experiments is to assess the capacity of the proposed multi-agent system to progressively and autonomously acquire cooperative sensorimotor skills through a self-learning process, that is, without the use of any explicit model-based planning strategy. In this paper, generalization and robustness properties of the overall multi-agent system are explored. Furthermore, the proposed framework is evaluated in constrained motion tasks, both in static and non-static environments. The computational cost of the proposed multi-agent architecture is also assessed.
Keywords :
dexterous manipulators; hierarchical systems; learning (artificial intelligence); multi-robot systems; redundant manipulators; topology; DOF; constrained motion tasks; cooperative sensorimotor skills; degrees-of-freedom; dexterous-redundant robot manipulation systems; kinematic redundancies; knowledge generalization properties; model-free learning generalization; model-free learning robustness; modular scaling; nested-hierarchical multiagent topology; recursive learning process; reinforcement learning; robot kinematic control; self-learning process; single-linkage open kinematic chain; state-to-action mapping space; Joints; Kinematics; Learning systems; Multiagent systems; Robots; Topology; Training;
Conference_Titel :
Biomedical Robotics and Biomechatronics (BioRob), 2012 4th IEEE RAS & EMBS International Conference on
Conference_Location :
Rome
Print_ISBN :
978-1-4577-1199-2
DOI :
10.1109/BioRob.2012.6290276