مرکز منطقه ای اطلاع رساني علوم و فناوري - Explore to see, learn to perceive, get the actions for free: SKILLABILITY

DocumentCode :

270753

Title :

Explore to see, learn to perceive, get the actions for free: SKILLABILITY

Author :

Kompella, Varan R. ; Stollenga, Marijn F. ; Luciw, Matthew D. ; Schmidhuber, JuÌˆrgen

Author_Institution :

IDSIA, Manno-Lugano, Switzerland

fYear :

2014

fDate :

6-11 July 2014

Firstpage :

2705

Lastpage :

2712

Abstract :

How can a humanoid robot autonomously learn and refine multiple sensorimotor skills as a byproduct of curiosity driven exploration, upon its high-dimensional unprocessed visual input? We present SKILLABILITY, which makes this possible. It combines the recently introduced Curiosity Driven Modular Incremental Slow Feature Analysis (Curious Dr. MISFA) with the well-known options framework. Curious Dr. MISFA´s objective is to acquire abstractions as quickly as possible. These abstractions map high-dimensional pixel-level vision to a low-dimensional manifold. We find that each learnable abstraction augments the robot´s state space (a set of poses) with new information about the environment, for example, when the robot is grasping a cup. The abstraction is a function on an image, called a slow feature, which can effectively discretize a high-dimensional visual sequence. For example, it maps the sequence of the robot watching its arm as it moves around, grasping randomly, then grasping a cup, and moving around some more while holding the cup, into a step function having two outputs: when the cup is or is not currently grasped. The new state space includes this grasped/not grasped information. Each abstraction is coupled with an option. The reward function for the option´s policy (learned through Least Squares Policy Iteration) is high for transitions that produce a large change in the step-functionlike slow features. This corresponds to finding bottleneck states, which are known good subgoals for hierarchical reinforcement learning - in the example, the subgoal corresponds to grasping the cup. The final skill includes both the learned policy and the learned abstraction. SKILLABILITY makes our iCub the first humanoid robot to learn complex skills such as to topple or grasp an object, from raw high-dimensional video input, driven purely by its intrinsic motivations.

Keywords :

humanoid robots; image sequences; learning (artificial intelligence); robot vision; SKILLABILITY; curiosity driven exploration; curiosity driven modular incremental slow feature analysis; high-dimensional pixel-level vision; high-dimensional visual sequence; humanoid robot; iCub robot; learnable abstraction; least squares policy iteration; low-dimensional manifold learning; reward function; sensorimotor skills; slow feature; Estimation error; Feature extraction; Grasping; Learning (artificial intelligence); Robots; Switches; Visualization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks (IJCNN), 2014 International Joint Conference on

Conference_Location :

Beijing

Print_ISBN :

978-1-4799-6627-1

Type :

conf

DOI :

10.1109/IJCNN.2014.6889784

Filename :

6889784

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=270753