مرکز منطقه ای اطلاع رساني علوم و فناوري - Control of nonholonomic mobile robot by an adaptive actor-critic method with simulated experience based value-functions

DocumentCode :

1886281

Title :

Control of nonholonomic mobile robot by an adaptive actor-critic method with simulated experience based value-functions

Author :

Syam, Rafiuddin ; Watanabe, Keigo ; Izumi, Kiyotaka ; Kiguchi, Kazuo

Author_Institution :

Dept. of Adv. Syst. Control Eng., Saga Univ., Japan

Volume :

fYear :

2002

fDate :

2002

Firstpage :

3960

Abstract :

An adaptive actor-critic algorithm is proposed under the assumption that a predictive model is available and only the measurement at time k is used to update the learning algorithms. Two value-functions are realized as a pure static mapping, according to the fact that they can be reduced to nonlinear current estimators, which can be easily constructed by using any artificial neural networks (NNs) with sigmoidal function or radial basis function (RBF), if all the inputs to the present value-functions are based on simulated experiences generated from the predictive model. In addition, if a predictive model is assumed to be used to construct a model-based actor (MBA) in the framework of adaptive actor-critic approach, then this type of MBA can be viewed as a network whose connection weights are composed of the elements of feedback gain matrix, so that the temporal difference (TD) learning can also be naturally applied to update the weights of the actor. Since the present method can update the learning by using only one measurement at time k, a relatively fast learning is expected, compared with the previous approach that needs two measurements at times k and k + 1 to update the actor-critic networks. The effectiveness of the proposed approach is illustrated by simulating a trajectory-tracking control problem for a nonholonomic mobile robot.

Keywords :

feedback; learning (artificial intelligence); mobile robots; radial basis function networks; robot kinematics; adaptive actor-critic method; artificial neural networks; feedback gain matrix; kinematic model; nonholonomic mobile robot; nonlinear current estimators; pure static mapping; radial basis function; sigmoidal function; simulated experience based value-functions; temporal difference learning; trajectory-tracking control problem; Adaptive control; Artificial neural networks; Control engineering; Decision making; Kinematics; Mobile robots; Neural networks; Predictive models; Programmable control; Time measurement;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Robotics and Automation, 2002. Proceedings. ICRA '02. IEEE International Conference on

Print_ISBN :

0-7803-7272-7

Type :

conf

DOI :

10.1109/ROBOT.2002.1014349

Filename :

1014349

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1886281