DocumentCode :
663354
Title :
Learning while preventing mechanical failure due to random motions
Author :
Meijdam, H.J. ; Plooij, M.C. ; Caarls, Wouter
fYear :
2013
fDate :
3-7 Nov. 2013
Firstpage :
182
Lastpage :
187
Abstract :
Learning can be used to optimize robot motions to new situations. Learning motions can cause high frequency random motions in the exploration phase and can cause failure before the motion is learned. The mean time between failures (MTBF) of a robot can be predicted while it is performing these motions. The predicted MTBF in the exploration phase can be increased by filtering actions or possible actions of the algorithm. We investigated five algorithms that apply this filtering in various ways and compared them to SARSA(λ) learning. In general, increasing the MTBF decreases the learning performance. Three of the investigated algorithms are unable to increase the MTBF while keeping their learning performance approximately equal to SARSA(λ). Two algorithms are able to do this: the PADA algorithm and the low-pass filter algorithm. In case of LEO, a bipedal walking robot that tries to optimize a walking motion, the MTBF can be increased by a factor of 108 compared to SARSA(λ). This indicates that, in some cases, failures due to high frequency random motions can be prevented without decreasing the performance.
Keywords :
failure (mechanical); learning systems; legged locomotion; low-pass filters; motion control; optimal control; LEO; PADA algorithm; SARSA(λ) learning; action filtering; bipedal walking robot; exploration phase; high frequency random motion; learning performance; low-pass filter algorithm; mean time between failures; mechanical failure prevention; motion learning; robot MTBF; robot motion optimization; walking motion optimization; Approximation algorithms; Gears; Low earth orbit satellites; Markov processes; Robots; Stress; Torque;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems (IROS), 2013 IEEE/RSJ International Conference on
Conference_Location :
Tokyo
ISSN :
2153-0858
Type :
conf
DOI :
10.1109/IROS.2013.6696351
Filename :
6696351
Link To Document :
بازگشت