DocumentCode :
739544
Title :
Stochastic Power Adaptation with Multiagent Reinforcement Learning for Cognitive Wireless Mesh Networks
Author :
Xianfu Chen ; Zhifeng Zhao ; Honggang Zhang
Author_Institution :
VTT Tech. Res. Centre of Finland, Oulu, Finland
Volume :
12
Issue :
11
fYear :
2013
Firstpage :
2155
Lastpage :
2166
Abstract :
As the scarce spectrum resource is becoming overcrowded, cognitive radio indicates great flexibility to improve the spectrum efficiency by opportunistically accessing the authorized frequency bands. One of the critical challenges for operating such radios in a network is how to efficiently allocate transmission powers and frequency resource among the secondary users (SUs) while satisfying the quality-of-service constraints of the primary users. In this paper, we focus on the noncooperative power allocation problem in cognitive wireless mesh networks formed by a number of clusters with the consideration of energy efficiency. Due to the SUs´ dynamic and spontaneous properties, the problem is modeled as a stochastic learning process. We first extend the single-agent Q-learning to a multiuser context, and then propose a conjecture-based multiagent Q-learning algorithm to achieve the optimal transmission strategies with only private and incomplete information. An intelligent SU performs Q-function updates based on the conjecture over the other SUs´ stochastic behaviors. This learning algorithm provably converges given certain restrictions that arise during the learning procedure. Simulation experiments are used to verify the performance of our algorithm and demonstrate its effectiveness of improving the energy efficiency.
Keywords :
cognitive radio; learning (artificial intelligence); multi-agent systems; stochastic processes; telecommunication computing; telecommunication power management; wireless mesh networks; SU stochastic behaviors; authorized frequency bands; cognitive wireless mesh networks; conjecture based multiagent Q-learning algorithm; frequency resource; multiagent reinforcement learning; multiuser context; optimal transmission strategies; power allocation problem; quality-of-service constraints; scarce spectrum resource; secondary users; single agent Q-learning; spectrum efficiency; stochastic power adaptation; transmission powers; Algorithm design and analysis; Games; Interference; Resource management; Signal to noise ratio; Stochastic processes; Wireless communication; Cognitive radio; algorithm/protocol design and analysis; reinforcement learning; resource allocation;
fLanguage :
English
Journal_Title :
Mobile Computing, IEEE Transactions on
Publisher :
ieee
ISSN :
1536-1233
Type :
jour
DOI :
10.1109/TMC.2012.178
Filename :
6265055
Link To Document :
بازگشت