• DocumentCode
    45716
  • Title

    Cooperation in wireless networks: a game-theoretic framework with reinforcement learning

  • Author

    Baidas, Mohammed W.

  • Author_Institution
    Electr. Eng. Dept., Kuwait Univ., Safat, Kuwait
  • Volume
    8
  • Issue
    5
  • fYear
    2014
  • fDate
    March 27 2014
  • Firstpage
    740
  • Lastpage
    753
  • Abstract
    A game-theoretic framework based on the iterated prisoner´s dilemma (IPD) is proposed to model the repeated dynamic interactions of multiple source nodes when communicating with multiple destinations in an ad hoc wireless network. In such networks where nodes are autonomous, selfish and not familiar with other nodes´ strategies, fully cooperative behaviours cannot be assumed. Therefore reinforcement learning is studied to relate the utility function of each source node to actions previously taken in order to learn a strategy that maximises their expected future reward. Particularly, a Q-learning algorithm is proposed to allow network nodes to adapt to and play the IPD game against opponents with a variety of known and unknown strategies. Simulation results illustrate that the proposed Q-learning algorithm allows network nodes to play optimally and achieve their maximum expected return values.
  • Keywords
    ad hoc networks; cooperative communication; game theory; iterative methods; learning (artificial intelligence); IPD game; Q-learning algorithm; ad hoc wireless network; cooperation communication; game theory; iterated prisoner dilemma; maximum expected return value; network node; reinforcement learning; repeated dynamic interaction; source node; utility function;
  • fLanguage
    English
  • Journal_Title
    Communications, IET
  • Publisher
    iet
  • ISSN
    1751-8628
  • Type

    jour

  • DOI
    10.1049/iet-com.2013.0817
  • Filename
    6777119