مرکز منطقه ای اطلاع رساني علوم و فناوري - Implementation of Q — Learning algorithm for solving maze problem

DocumentCode :

3642752

Title :

Implementation of Q — Learning algorithm for solving maze problem

Author :

D. Osmanković;S. Konjicija

Author_Institution :

Department of Automatic Control and Electronics, University in Sarajevo, Faculty of Electrical Engineering in Sarajevo, Zmaja od Bosne bb, 71000 Sarajevo, Bosnia and Herzegovina

fYear :

2011

fDate :

5/1/2011 12:00:00 AM

Firstpage :

1619

Lastpage :

1622

Abstract :

Machine learning is very important in several fields ranging from control systems to data mining. This paper presents Q - Learning implementation for abstract graph models with maze solving (finding the trajectory out of the maze) taken as example of graph problem. The paper consists of conversion of maze matrix to Q - Learning reward matrix, and also the implementation of Q - Learning algorithm for the reward matrix (similar to minimizing criteria matrix in dynamic programming). This implementation is on higher level of abstraction, so other representations can be used (artificial neural networks, tree etc.). For the testing of Q - Learning algorithm, maze solving problem was visualized in MATLAB programming language with the found trajectory marked on the maze. The maze in this paper is defined with starting position in the top left corner and the exit in the bottom right corner. The performance of the algorithm is measured for different scales of the problem.

Publisher :

ieee

Conference_Titel :

MIPRO, 2011 Proceedings of the 34th International Convention

Print_ISBN :

978-1-4577-0996-8

Type :

conf

Filename :

5967320

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3642752