Title :
Model-Based and Learning-Based Decision Making in Incomplete Information Cournot Games: A State Estimation Approach
Author :
Kebriaei, Hamed ; Rahimi-Kian, Ashkan ; Ahmadabadi, Majid Nili
Author_Institution :
Sch. of Electr. & Comput. Eng., Univ. of Tehran, Tehran, Iran
Abstract :
In an incomplete information game, a big challenge is to find the best way of exploiting available information for optimal decision making of the agents. In this paper, two decision making methods, namely model-based and learning-based bidding strategies, are proposed and compared, for repeated Cournot competition of the generators in a day-ahead electricity market. The sum of the rivals´ offered quantities (SROQ) is considered as the state of the agent and its value is estimated using an adaptive expectation method. In the model-based approach, the convergence of the agents´ strategies to the Nash equilibrium point is also studied in two different cases. In the learning-based approach, the optimal bidding strategy is learned through combination of state estimation and a reinforcement learning method. Using the estimated state (SROQ), the optimal decision is learned through a fuzzy Q-learning algorithm. Through a case study, which is performed on the three-bus benchmark Cournot model, the convergence of the generators´ bids to the Nash-Cournot equilibrium is examined.
Keywords :
decision making; fuzzy set theory; game theory; learning (artificial intelligence); power engineering computing; power markets; Nash-Cournot equilibrium; SROQ; adaptive expectation method; day-ahead electricity market; fuzzy Q-learning algorithm; incomplete information Cournot game; learning-based decision making; model-based decision making; optimal bidding strategy; reinforcement learning; state estimation approach; sum of the rivals offered quantities; Computational modeling; Decision making; Eigenvalues and eigenfunctions; Games; Generators; Jacobian matrices; Mathematical model; Cournot oligopoly; fuzzy Q-learning (FQL); learning; repeated game; state estimation;
Journal_Title :
Systems, Man, and Cybernetics: Systems, IEEE Transactions on
DOI :
10.1109/TSMC.2014.2373336