Title :
Reinforcement learning approaches and evaluation criteria for opportunistic spectrum access
Author :
Robert, C. ; Moy, Christophe ; Cheng-Xiang Wang
Author_Institution :
SUPELEC/IETR, Cesson-Sévigné, France
Abstract :
This paper deals with the learning and decision making issue for cognitive radio (CR). Two reinforcement-learning algorithms proposed in the literature are compared for opportunistic spectrum access (OSA): Upper Confidence Bound (UCB) algorithm and Weight Driven (WD) algorithm. This paper also introduces two new metrics in order to evaluate the machine learning algorithm performance for CR: effective cumulative regret and percentage of successful trials. They provide a fair evaluation means for CR performance.
Keywords :
cognitive radio; learning (artificial intelligence); telecommunication computing; CR performance; OSA; UCB algorithm; WD algorithm; cognitive radio; effective cumulative percentage; effective cumulative regret; evaluation criteria; machine learning algorithm performance; opportunistic spectrum access; reinforcement learning approaches; upper confidence bound algorithm; weight driven algorithm; Cognitive radio; Decision support systems; Cognitive radio; MAB; UCB; machine learning; opportunistic spectrum access;
Conference_Titel :
Communications (ICC), 2014 IEEE International Conference on
Conference_Location :
Sydney, NSW
DOI :
10.1109/ICC.2014.6883535