Title :
Regret Minimization Under Partial Monitoring
Author :
Cesa-Bianchi, Nicolò ; Lugosi, Gábor ; Stoltz, Gilles
Author_Institution :
Dipartimento di Scienze dell´´Informazione, Università degli Studi di Milano, 20135 Milano, Italy, Email: cesa-bianchi@dsi.unimi.it
Abstract :
We consider repeated games in which the player, instead of observing the action chosen by the opponent in each game round, receives a feedback generated by the combined choice of the two players. We study Hannan consistent players for these games, that is, randomized playing strategies whose per-round regret vanishes with probability one as the number of game rounds goes to infinity. We prove a general lower bound for the convergence rate of the regret, and exhibit a specific strategy that attains this rate for any game for which a Hannan consistent player exists.
Keywords :
Convergence; Feedback; H infinity control; Monitoring; Pricing; Yttrium;
Conference_Titel :
Information Theory Workshop, 2006. ITW '06 Punta del Este. IEEE
Conference_Location :
Punta del Este, Uruguay
Print_ISBN :
1-4244-0035-X
Electronic_ISBN :
1-4244-0036-8
DOI :
10.1109/ITW.2006.1633784