DocumentCode :
3165152
Title :
Empirical evidence equilibria in stochastic games
Author :
Dudebout, N. ; Shamma, Jeff S.
Author_Institution :
Decision & Control Lab., Georgia Inst. of Technol., Atlanta, GA, USA
fYear :
2012
fDate :
10-13 Dec. 2012
Firstpage :
5780
Lastpage :
5785
Abstract :
The framework of empirical evidence equilibrium (EEE) for stochastic games is developed in this paper. In a stochastic game, agents collectively influence the dynamic of the environment. In standard equilibria, each agent´s strategy is optimal with respect to its opponents´ strategies. Therefore, each strategy is the solution to a partially observable Markov decision process (POMDP). The following considerations motivate the notion of EEE. First, solutions to a POMDP can be prohibitively complex to compute and implement. Second, agents might not fully understand the environment´s dynamic. Third, standard equilibria do not accommodate different levels of bounded rationality among agents. Finally, reaching equilibrium in stochastic games has not been adequately addressed. In the EEE framework, each agent formulates a simple model of its opponents´ effects. It neglects that agents are mutually dependent through the environment and computes an optimal strategy associated with its model. The agents play their strategies against each other and make some observations. Agents are in EEE when the models are consistent with these empirical observations. In this paper, the notion of EEE is formalized and an existence result is established in a general setting. Relations with other equilibria, including mean field equilibria, are also presented. Finally, the learning of EEEs by simple adaptive processes is illustrated through simulation.
Keywords :
Markov processes; decision theory; learning (artificial intelligence); multi-agent systems; observability; stochastic games; POMDP; adaptive process; agent bounded rationality; empirical evidence equilibria; environment dynamics; learning; mean field equilibria; mutually dependent agents; opponent effect; opponent strategy; optimal agent strategy; optimal strategy; partially observable Markov decision process; stochastic games; Computational modeling; Face; Games; History; Markov processes; Monitoring;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control (CDC), 2012 IEEE 51st Annual Conference on
Conference_Location :
Maui, HI
ISSN :
0743-1546
Print_ISBN :
978-1-4673-2065-8
Electronic_ISBN :
0743-1546
Type :
conf
DOI :
10.1109/CDC.2012.6426118
Filename :
6426118
Link To Document :
بازگشت