مرکز منطقه ای اطلاع رساني علوم و فناوري - A Decision Task in a Social Context: Human Experiments, Models, and Analyses of Behavioral Data

DocumentCode :

1367325

Title :

A Decision Task in a Social Context: Human Experiments, Models, and Analyses of Behavioral Data

Author :

Nedic, Andrea ; Tomlin, Damon ; Holmes, Philip ; Prentice, Deborah A. ; Cohen, J.D.

Author_Institution :

Dept. of Electr. Eng., Princeton Univ., Princeton, NJ, USA

Volume :

100

Issue :

fYear :

2012

fDate :

3/1/2012 12:00:00 AM

Firstpage :

713

Lastpage :

733

Abstract :

To investigate the influence of information about fellow group members in a constrained decision-making context, we develop four two-armed bandit tasks in which subjects freely select one of two options (A or B) and are informed of the resulting reward following each choice. Rewards are determined by the fraction x of past A choices by two functions f_A(x),f_B(x) (unknown to the subject) which intersect at a matching point x that does not generally represent globally optimal behavior. Playing individually, subjects typically remain close to the matching point, although some discover the optimum. Each task is designed to probe a different type of behavior, and subjects work in parallel in groups of five with feedback of other group members´ choices, of their rewards, of both, or with no knowledge of others´ behavior. We employ a soft-max choice model that emerges from a drift-diffusion process, commonly used to model perceptual decision making with noisy stimuli. Here the stimuli are replaced by estimates of expected rewards produced by a temporal-difference reinforcement-learning algorithm, augmented to include appropriate feedback terms. Models are fitted for each task and feedback condition, and we compare choice allocations averaged across subjects and individual choice sequences to highlight differences between tasks and intersubject differences. The most complex model, involving both choice and reward feedback, contains only four parameters, but nonetheless reveals significant differences in individual strategies. Strikingly, we find that rewards feedback can be either detrimental or advantageous to performance, depending upon the task.

Keywords :

behavioural sciences; data analysis; decision making; learning (artificial intelligence); social sciences; behavioral data analyses; choice allocations; choice sequences; constrained decision-making context; decision task; drift-diffusion process; feedback condition; human experiments; matching point; noisy stimuli; perceptual decision making model; social context; task condition; temporal-difference reinforcement-learning algorithm; two-armed bandit tasks; Analytical models; Behavioral science; Context modeling; Data models; Decision making; Human factors; Information processing; Resource management; Social factors; Decision making; drift-diffusion model; exploitation; exploration; group dynamics; human behavior; reinforcement learning; social information; two-armed bandit task;

fLanguage :

English

Journal_Title :

Proceedings of the IEEE

Publisher :

ieee

ISSN :

0018-9219

Type :

jour

DOI :

10.1109/JPROC.2011.2166437

Filename :

6069518

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1367325