مرکز منطقه ای اطلاع رساني علوم و فناوري - Approximate policy iteration with unsupervised feature learning based on manifold regularization

DocumentCode :

3661004

Title :

Approximate policy iteration with unsupervised feature learning based on manifold regularization

Author :

Hongliang Li;Derong Liu;Ding Wang

Author_Institution :

The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China

fYear :

2015

fDate :

7/1/2015 12:00:00 AM

Firstpage :

Lastpage :

Abstract :

In this paper, we develop a novel approximate policy iteration reinforcement learning algorithm with unsupervised feature learning based on manifold regularization. The proposed algorithm can automatically learn data-driven smooth basis representations for value function approximation, which can preserve the intrinsic geometry of the state space of Markov decision processes. Moreover, it can provide a direct basis extension for new samples in both policy learning and policy control processes. We evaluate the effectiveness and efficiency of the proposed algorithm on the inverted pendulum task. Simulation results show that this algorithm can learn smooth basis representations and excellent control policies.

Publisher :

ieee

Conference_Titel :

Neural Networks (IJCNN), 2015 International Joint Conference on

Electronic_ISBN :

2161-4407

Type :

conf

DOI :

10.1109/IJCNN.2015.7280311

Filename :

7280311

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3661004