Title :
Behavior Cloning by a Self-Organizing Decision Tree
Author :
Hwang, Kao-Shing ; Chen, Yu-Jen ; Yang, Tsan-Hui
Author_Institution :
Chung Cheng Univ., Chiayi
Abstract :
It is hard to define a state space or the proper reward function in reinforcement learning to make the robot act as expected. In this paper, we demonstrate the expected behavior for a robot Then a RL-based decision tree approach which decides to split according to long-term evaluations, instead of a top-down greedy strategy which finds out the relationship between the input and output from the demonstration data. We use this method to teach a robot for target seeking problem. In order to promote the performance in tackling target seeking problem, we add a Q-learning along with the state space based on RL-based decision tree. The experiment result shows that Q-Iearning can promote the performance quickly. For demonstration, we build a mobile robot powered by an embedded board. The robot can detect the hall of the range in any direction with omni-directional vision system. With such powerful embedded computing capability and the efficient machine vision system, the robot can inherit the learned behavior from a simulator which has learned the empirical behavior and continue to learn with Q-learning to improve the performance of target seeking problem.
Keywords :
decision trees; learning (artificial intelligence); mobile robots; robot vision; RL-based self-organizing decision tree; behavior cloning; embedded board; machine vision system; mobile robot; omni-directional vision system; reinforcement Q-learning; robot; state space; target seeking problem; top-down greedy strategy; Cloning; Decision trees; Educational robots; Embedded computing; Learning; Machine vision; Mobile robots; Orbital robotics; Robot vision systems; State-space methods; machine learning; robotics;
Conference_Titel :
Integration Technology, 2007. ICIT '07. IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
1-4244-1092-4
Electronic_ISBN :
1-4244-1092-4
DOI :
10.1109/ICITECHNOLOGY.2007.4290416