مرکز منطقه ای اطلاع رساني علوم و فناوري - Hybrid learning strategy to solve pendulum swing-up problem for real hardware

DocumentCode :

3454407

Title :

Hybrid learning strategy to solve pendulum swing-up problem for real hardware

Author :

Nakamura, Shingo ; Hashimoto, Shuji

Author_Institution :

Dept. of Appl. Phys., Waseda Univ., Tokyo

fYear :

2007

fDate :

15-18 Dec. 2007

Firstpage :

1972

Lastpage :

1977

Abstract :

In this paper, we propose a machine learning strategy to obtain the optimal controller for actual machine using hybrid platforms; real hardware and simulator. A simulator consists of the neural networks which directly can learn actual behaviors of the latest hardware and emulates them without physical modeling. On the other hand, the controller of the hardware is trained with the simulator by the reinforcement learning method to realize the optimal control for the target task, and applied to the real hardware. Then, as long as the iteration of these processes is simultaneously performed, the system can automatically generate the optimal controller without any works even when hardware constitution is changed or switched. In this manner, the real hardware and the simulator affect each other to make the system adaptable. Furthermore, in the processes of sampling and supplying hardware data, we put a buffering component. It keeps the latest data of the hardware and supplies non-biased data to the simulator. As an example of the proposal method, we pick up the pendulum swing-up problem. In the experiments, firstly, the optimization process performs step by step for the initial hardware constitution and the basic idea of the method is evaluated. Afterward, by changing a pendulum, we confirm system can autonomously generate the new optimal controller for the real hardware without any human operations.

Keywords :

buffer storage; control engineering computing; learning (artificial intelligence); neurocontrollers; optimal control; robots; buffering component; hybrid learning; machine learning; neural network; optimal control; optimization; pendulum swing-up problem; real hardware; reinforcement learning; robots; Automatic control; Automatic generation control; Constitution; Control systems; Machine learning; Neural network hardware; Neural networks; Optimal control; Proposals; Sampling methods; Machine learning; Pendulum swing-up problem; Simulator construction;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Robotics and Biomimetics, 2007. ROBIO 2007. IEEE International Conference on

Conference_Location :

Sanya

Print_ISBN :

978-1-4244-1761-2

Electronic_ISBN :

978-1-4244-1758-2

Type :

conf

DOI :

10.1109/ROBIO.2007.4522469

Filename :

4522469

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3454407