DocumentCode :
664041
Title :
Active Bayesian perception and reinforcement learning
Author :
Lepora, N.F. ; Martinez-Hernandez, U. ; Pezzulo, Giovanni ; Prescott, T.J.
Author_Institution :
SCentRo, Univ. of Sheffield, Sheffield, UK
fYear :
2013
fDate :
3-7 Nov. 2013
Firstpage :
4735
Lastpage :
4740
Abstract :
In a series of papers, we have formalized an active Bayesian perception approach for robotics based on recent progress in understanding animal perception. However, an issue for applied robot perception is how to tune this method to a task, using: (i) a belief threshold that adjusts the speed-accuracy tradeoff; and (ii) an active control strategy for relocating the sensor e.g. to a preset fixation point. Here we propose that these two variables should be learnt by reinforcement from a reward signal evaluating the decision outcome. We test this claim with a biomimetic fingertip that senses surface curvature under uncertainty about contact location. Appropriate formulation of the problem allows use of multi-armed bandit methods to optimize the threshold and fixation point of the active perception. In consequence, the system learns to balance speed versus accuracy and sets the fixation point to optimize both quantities. Although we consider one example in robot touch, we expect that the underlying principles have general applicability.
Keywords :
Bayes methods; belief networks; biomimetics; intelligent robots; learning (artificial intelligence); optimisation; uncertainty handling; active Bayesian perception; active control strategy; animal perception; belief threshold; biomimetic fingertip; contact location; fixation point; multiarmed bandit method; reinforcement learning; reward signal evaluation; robot perception; sensor relocation; speed accuracy tradeoff adjustment; surface curvature sensing; threshold optimization; uncertainty handling; Accuracy; Bayes methods; Decision making; Learning (artificial intelligence); Tactile sensors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems (IROS), 2013 IEEE/RSJ International Conference on
Conference_Location :
Tokyo
ISSN :
2153-0858
Type :
conf
DOI :
10.1109/IROS.2013.6697038
Filename :
6697038
Link To Document :
بازگشت