مرکز منطقه ای اطلاع رساني علوم و فناوري - Controlling the learning process of real-time heuristic search Original Research Article

Title of article :

Controlling the learning process of real-time heuristic search Original Research Article

Author/Authors :

Masashi Shimbo، نويسنده , , Toru Ishida، نويسنده ,

Issue Information :

روزنامه با شماره پیاپی سال 2003

Pages :

From page :

To page :

Abstract :

Real-time search provides an attractive framework for intelligent autonomous agents, as it allows us to model an agentʹs ability to improve its performance through experience. However, the behavior of real-time search agents is far from rational during the learning (convergence) process, in that they fail to balance the efforts to achieve a short-term goal (i.e., to safely arrive at a goal state in the present problem solving trial) and a long-term goal (to find better solutions through repeated trials). As a remedy, we introduce two techniques for controlling the amount of exploration, both overall and per trial. The weighted real-time search reduces the overall amount of exploration and accelerates convergence. It sacrifices admissibility but provides a nontrivial bound on the converged solution cost. The real-time search with upper bounds insures solution quality in each trial when the state space is undirected. These techniques result in a convergence process more stable compared with that of the Learning Real-Time A∗ algorithm.

Keywords :

Adaptive learning , Convergence process , Resource-boundedness , Real-time heuristic search , Rational agent

Journal title :

Artificial Intelligence

Serial Year :

2003

Journal title :

Artificial Intelligence

Record number :

1207259

Link To Document :

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=1207259