• DocumentCode
    1976002
  • Title

    Regret Minimization Under Partial Monitoring

  • Author

    Cesa-Bianchi, Nicolò ; Lugosi, Gábor ; Stoltz, Gilles

  • Author_Institution
    Dipartimento di Scienze dell´´Informazione, Università degli Studi di Milano, 20135 Milano, Italy, Email: cesa-bianchi@dsi.unimi.it
  • fYear
    2006
  • fDate
    13-17 March 2006
  • Firstpage
    72
  • Lastpage
    76
  • Abstract
    We consider repeated games in which the player, instead of observing the action chosen by the opponent in each game round, receives a feedback generated by the combined choice of the two players. We study Hannan consistent players for these games, that is, randomized playing strategies whose per-round regret vanishes with probability one as the number of game rounds goes to infinity. We prove a general lower bound for the convergence rate of the regret, and exhibit a specific strategy that attains this rate for any game for which a Hannan consistent player exists.
  • Keywords
    Convergence; Feedback; H infinity control; Monitoring; Pricing; Yttrium;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Theory Workshop, 2006. ITW '06 Punta del Este. IEEE
  • Conference_Location
    Punta del Este, Uruguay
  • Print_ISBN
    1-4244-0035-X
  • Electronic_ISBN
    1-4244-0036-8
  • Type

    conf

  • DOI
    10.1109/ITW.2006.1633784
  • Filename
    1633784