مرکز منطقه ای اطلاع رساني علوم و فناوري - A fast learning variable lambda TD model: Used to realize home aware robot navigation

DocumentCode :

1798224

Title :

A fast learning variable lambda TD model: Used to realize home aware robot navigation

Author :

Altahhan, Abdulrahman

Author_Institution :

Coll. of Comput. Inf. Technol., American Univ. in the Emirates, United Arab Emirates

fYear :

2014

fDate :

6-11 July 2014

Firstpage :

1534

Lastpage :

1541

Abstract :

This work describes a fast learning robot goal-aware navigation model that employs both gradient and conjugate gradient Temporal Difference (TD, TD-conj) methods. It builds on the fact that TD-conj was proven to be equivalent to a gradient TD method with a variable lambda under certain conditions. Based on straightforward features extraction process combined with goal-aware capabilities provided by whole image measure, the model solves what we call u-turn-homing benchmark problem without using landmarks. Only one goal snapshot was used with agent facing the goal directly. Therefore a novel threshold stopping formula was used to recognize the goal which is less sensitive to the agent-goal orientation problem. Unlike other models, this model refrains from artificially manipulating or assuming a priori knowledge about the environment, two constraints that widely restrict the applicability of existing models in realistic scenarios. An on-line control method was used to train a set of neural networks. With the aid of variable and fixed eligibility traces, these networks approximate the agent´s action-value function allowing it to take close to optimal actions to reach its home. The effectiveness of the model was experimentally verified on an agent.

Keywords :

conjugate gradient methods; learning (artificial intelligence); path planning; robots; agent action-value function; agent-goal orientation problem; conjugate gradient temporal difference method; fast learning variable lambda TD model; feature extraction process; home aware robot navigation; image measure; robot goal-aware navigation model; temporal difference model; threshold stopping formula; u-turn-homing benchmark problem; Approximation algorithms; Function approximation; Histograms; Navigation; Robots; Vectors; Home Aware; Orientation Insensitive Thersholding; TD-conj; U-Turn-Homin; Variable 1 TD;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks (IJCNN), 2014 International Joint Conference on

Conference_Location :

Beijing

Print_ISBN :

978-1-4799-6627-1

Type :

conf

DOI :

10.1109/IJCNN.2014.6889845

Filename :

6889845

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1798224