DocumentCode :
3165845
Title :
Towards optimization of a human-inspired heuristic for solving explore-exploit problems
Author :
Reverdy, Paul ; Wilson, Richard C. ; Holmes, Pat ; Leonard, Naomi Ehrich
Author_Institution :
Dept. of Mech. & Aerosp. Eng., Princeton Univ., Princeton, NJ, USA
fYear :
2012
fDate :
10-13 Dec. 2012
Firstpage :
2820
Lastpage :
2825
Abstract :
Motivated by models of human decision making, we consider a heuristic solution for explore-exploit problems. In a numerical example we show that, with appropriate parameter values, the algorithm performs well. However, the parameters of the algorithm trade off exploration against exploitation in a complicated way so that finding the optimal parameter values is not obvious. We show that the optimal parameter values can be analytically computed in some cases and prove that suboptimal parameter tunings can provide robustness to modeling error. The analytic results suggest a feedback control law for dynamically optimizing parameters.
Keywords :
decision making; feedback; optimal control; optimisation; robust control; dynamically optimizing parameter; error modeling; explore-exploit problem; feedback control law; human decision making; human-inspired heuristic; optimal parameter value; optimization; robustness; suboptimal parameter tuning; Heuristic algorithms; Humans; Noise; Optimization; Switches; Tuning; USA Councils;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control (CDC), 2012 IEEE 51st Annual Conference on
Conference_Location :
Maui, HI
ISSN :
0743-1546
Print_ISBN :
978-1-4673-2065-8
Electronic_ISBN :
0743-1546
Type :
conf
DOI :
10.1109/CDC.2012.6426148
Filename :
6426148
Link To Document :
بازگشت