DocumentCode
1976002
Title
Regret Minimization Under Partial Monitoring
Author
Cesa-Bianchi, Nicolò ; Lugosi, Gábor ; Stoltz, Gilles
Author_Institution
Dipartimento di Scienze dell´´Informazione, Università degli Studi di Milano, 20135 Milano, Italy, Email: cesa-bianchi@dsi.unimi.it
fYear
2006
fDate
13-17 March 2006
Firstpage
72
Lastpage
76
Abstract
We consider repeated games in which the player, instead of observing the action chosen by the opponent in each game round, receives a feedback generated by the combined choice of the two players. We study Hannan consistent players for these games, that is, randomized playing strategies whose per-round regret vanishes with probability one as the number of game rounds goes to infinity. We prove a general lower bound for the convergence rate of the regret, and exhibit a specific strategy that attains this rate for any game for which a Hannan consistent player exists.
Keywords
Convergence; Feedback; H infinity control; Monitoring; Pricing; Yttrium;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Theory Workshop, 2006. ITW '06 Punta del Este. IEEE
Conference_Location
Punta del Este, Uruguay
Print_ISBN
1-4244-0035-X
Electronic_ISBN
1-4244-0036-8
Type
conf
DOI
10.1109/ITW.2006.1633784
Filename
1633784
Link To Document