Title :
Reinforcement learning in BitTorrent systems
Author :
Izhak-Ratzin, Rafit ; Park, Hyunggon ; Van der Schaar, Mihaela
Author_Institution :
Palo Alto Networks, Sunnyvale, CA, USA
Abstract :
In this paper, we propose a BitTorrent-like protocol that replaces the peer selection mechanisms in the regular BitTorrent protocol with a novel reinforcement learning based mechanism. The inherent operation of P2P systems, which involves repeated interactions among peers over a long time period, allows peers to efficiently identify free-riders as well as desirable collaborators by learning the behavior of their associated peers. Thus, it can help peers improve their download rates and discourage free-riding (FR), while improving fairness. We model the peers´ interactions in the BitTorrent-like network as a repeated interaction game, where we explicitly consider the strategic behavior of the peers. A peer that applies the reinforcement learning based mechanism uses a partial history of the observations on associated peers´ statistical reciprocal behaviors to determine its best responses and estimate the corresponding impact on its expected utility. The policy determines the peer´s resource reciprocations with other peers, which would maximize the peer´s long-term performance.
Keywords :
learning (artificial intelligence); peer-to-peer computing; protocols; P2P systems; bittorrent protocol; peer selection mechanisms; reinforcement learning; strategic behavior; Bandwidth; Games; History; Learning; Peer to peer computing; Protocols; Robustness; BitTorrent; P2P; reinforcement learning;
Conference_Titel :
INFOCOM, 2011 Proceedings IEEE
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-9919-9
DOI :
10.1109/INFCOM.2011.5935192