مرکز منطقه ای اطلاع رساني علوم و فناوري - Policy gradient reinforcement learning for fast quadrupedal locomotion

DocumentCode :

3597306

Title :

Policy gradient reinforcement learning for fast quadrupedal locomotion

Author :

Kohl, Nate ; Stone, Peter

Author_Institution :

Dept. of Comput. Sci., Texas Univ., Austin, TX, USA

Volume :

fYear :

2004

Firstpage :

2619

Abstract :

This paper presents a machine learning approach to optimizing a quadrupedal trot gait for forward speed. Given a parameterized walk designed for a specific robot, we propose using a form of policy gradient reinforcement learning to automatically search the set of possible parameters with the goal of finding the fastest possible walk. We implement and test our approach on a commercially available quadrupedal robot platform, namely the Sony Aibo robot. After about three hours of learning, all on the physical robots and with no human intervention other than to change the batteries, the robots achieved a gait faster than any previously known gait known for the Aibo, significantly outperforming a variety of existing hand-coded and learned solutions.

Keywords :

gradient methods; learning (artificial intelligence); legged locomotion; Sony Aibo robot; fast quadrupedal locomotion; machine learning; policy gradient reinforcement learning; quadrupedal trot gait; Friction; Hardware; Humans; Leg; Legged locomotion; Machine learning; Robot control; Robotics and automation; Stability; Testing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Robotics and Automation, 2004. Proceedings. ICRA '04. 2004 IEEE International Conference on

ISSN :

1050-4729

Print_ISBN :

0-7803-8232-3

Type :

conf

DOI :

10.1109/ROBOT.2004.1307456

Filename :

1307456

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3597306