DocumentCode :
1792056
Title :
A multi-stage approach for efficiently learning humanoid robot stand-up behavior
Author :
Dingsheng Luo ; Yaoxiang Ding ; Zidong Cao ; Xihong Wu
Author_Institution :
Key Lab. of Machine Perception, Peking Univ., Beijing, China
fYear :
2014
fDate :
3-6 Aug. 2014
Firstpage :
884
Lastpage :
889
Abstract :
Stand-up motion is among the most essential behaviors for humanoid robots. For achieving stable stand-up behavior, the traditional key-frame based motion planning methods are time-exhausted and expert knowledge dependent. On the other hand, classic trial-and-error based learning methods are inefficient due to the high degrees of freedom (DOFs) for humanoid robots and the difficulty in fixing appropriate reward functions. In this paper, a multi-stage learning approach is proposed to address the above issues. At the first stage, under a trajectory based motion control model, key motion frames sampled from human motion capture data (HMCD) are used for model initialization, through which the solution space could be pruned. At the second stage, the design of experiments (DOE) technique is introduced for fast and active searching in the pruned solution space. At the last stage, a refining process that adopts a stochastic gradient learning strategy is performed to achieve the final behavior. Under this three-stage learning framework, along with a simple heuristic reward function, the learning of the stand-up behavior for a kid-size humanoid robot is fulfilled successfully and efficiently.
Keywords :
control engineering computing; design of experiments; gradient methods; humanoid robots; intelligent robots; learning (artificial intelligence); motion control; path planning; design of experiments technique; heuristic reward function; human motion capture data; key-frame based motion planning method; learning humanoid robot stand-up behavior; model initialization; multistage approach; multistage learning approach; refining process; stand-up motion; stochastic gradient learning strategy; three-stage learning framework; trial-and-error based learning method; Humanoid robots; Joints; Linear programming; Optimization; Stochastic processes; Trajectory; Design of experiments; Humanoid robot; Imitation learning; Stand-up behavior; Stochastic gradient optimization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mechatronics and Automation (ICMA), 2014 IEEE International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-1-4799-3978-7
Type :
conf
DOI :
10.1109/ICMA.2014.6885814
Filename :
6885814
Link To Document :
بازگشت