مرکز منطقه ای اطلاع رساني علوم و فناوري - Algorithmic models of human decision making in Gaussian multi-armed bandit problems

DocumentCode :

189567

Title :

Algorithmic models of human decision making in Gaussian multi-armed bandit problems

Author :

Reverdy, Paul ; Srivastava, Vishnu ; Leonard, Naomi Ehrich

Author_Institution :

Dept. of Mech. & Aerosp. Eng., Princeton Univ., Princeton, NJ, USA

fYear :

2014

fDate :

24-27 June 2014

Firstpage :

2210

Lastpage :

2215

Abstract :

We consider a heuristic Bayesian algorithm as a model of human decision making in multi-armed bandit problems with Gaussian rewards. We derive a novel upper bound on the Gaussian inverse cumulative distribution function and use it to show that the algorithm achieves logarithmic regret. We extend the algorithm to allow for stochastic decision making using Boltzmann action selection with a dynamic temperature parameter and provide a feedback rule for tuning the temperature parameter such that the stochastic algorithm achieves logarithmic regret. The stochastic algorithm encodes many of the observed features of human decision making.

Keywords :

Bayes methods; Gaussian distribution; decision making; stochastic processes; Boltzmann action selection; Gaussian inverse cumulative distribution function; Gaussian multiarmed bandit problems; Gaussian rewards; algorithmic models; dynamic temperature parameter; feedback rule; heuristic Bayesian algorithm; human decision making; logarithmic regret; stochastic algorithm; stochastic decision making; upper bound; Algorithm design and analysis; Bayes methods; Decision making; Equations; Heuristic algorithms; Random variables; Stochastic processes;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Control Conference (ECC), 2014 European

Conference_Location :

Strasbourg

Print_ISBN :

978-3-9524269-1-3

Type :

conf

DOI :

10.1109/ECC.2014.6862580

Filename :

6862580

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=189567