Title :
Conjectural variations in multi-agent reinforcement learning for energy-efficient cognitive wireless mesh networks
Author :
Chen, Xianfu ; Zhao, Zhifeng ; Zhang, Honggang ; Chen, Tao
Author_Institution :
York-Zhejiang Lab. for Cognitive Radio & Green Commun., Zhejiang Univ., Hangzhou, China
Abstract :
As energy saving and environmental protection become an inevitable trend, researchers need to shift their focus to “green” oriented architecture design. Recent advances in the area of cognitive radio (CR) have significant potential towards “green” communications. One of the critical challenges for operating CRs in a wireless mesh network is how to efficiently allocate transmission powers and frequency resource among the secondary users (SUs) while satisfying the quality-of-service constraints of primary users. Due to the SUs´ intelligent and selfish properties, this paper focuses on the non-cooperative spectrum sharing in cognitive wireless mesh networks formed by a number of clusters. In order to study the competition behaviors of SUs in a dynamic environment, the problem is modeled as a stochastic learning process. We first extend the single-agent reinforcement learning (RL) to a multi-user context, based on which a conjecture based multi-agent RL algorithm is proposed. A rational SU learns the optimal transmission strategy from the conjecture over the other SUs´ responses.
Keywords :
carrier transmission on power lines; cognitive radio; energy conservation; environmental factors; learning (artificial intelligence); multi-agent systems; quality of service; radio spectrum management; stochastic processes; telecommunication computing; wireless mesh networks; SU intelligent properties; SU responses; SU selfish properties; cognitive radio; competition behaviors; conjectural variations; conjecture-based multiagent RL algorithm; dynamic environment; energy-efficient cognitive wireless mesh networks; frequency resource; green communications; green oriented architecture design; multiagent reinforcement learning; multiuser context; noncooperative spectrum sharing; operating CR; optimal transmission strategy; quality-of-service constraints; rational SU; secondary users; single-agent reinforcement learning; stochastic learning process; transmission powers allocation; Games; Heuristic algorithms; Interference; Learning; Receivers; Resource management; Signal to noise ratio;
Conference_Titel :
Wireless Communications and Networking Conference (WCNC), 2012 IEEE
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0436-8
DOI :
10.1109/WCNC.2012.6214485