DocumentCode :
2520323
Title :
Dynamic Spectrum Access in realistic environments using reinforcement learning
Author :
Myrvoll, Tor Andre ; Håkegård, Jan Erik
Author_Institution :
SINTEF ICT, Trondheim, Norway
fYear :
2012
fDate :
2-5 Oct. 2012
Firstpage :
465
Lastpage :
470
Abstract :
We study the use of reinforcement learning to model Dynamic Spectrum Access in a realistic multi-channel environment. Three different approaches from the literature on the multi-armed bandit problem are compared on a set of realistic channel access models - two are based on stochastic models of the channel occupancy, while a third assumes an adversarial model. The algorithms are experimentally tested on channels occupied by primary users that behave according to a simple fair scheduler and a semi-Markov model based on WLAN traffic measurements; models that generate more realistic channel occupancy patterns than allowed by fixed i.i.d. probability models. The experiments show that the UCB1 algorithm of Auer et. al. [1] outperforms the other algorithms, and we support these findings using some simple theoretical results.
Keywords :
Markov processes; learning (artificial intelligence); probability; radio spectrum management; scheduling; telecommunication computing; telecommunication traffic; wireless LAN; UCB1 algorithm; WLAN traffic measurements; adversarial model; dynamic spectrum access; fair scheduler; fixed i.i.d. probability models; multiarmed bandit problem; primary users; realistic channel access models; realistic channel occupancy patterns; realistic environments; realistic multichannel environment; reinforcement learning; semiMarkov model; stochastic models; Availability; Channel estimation; Channel models; Sensors; Stochastic processes; Switches; Wireless LAN;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications and Information Technologies (ISCIT), 2012 International Symposium on
Conference_Location :
Gold Coast, QLD
Print_ISBN :
978-1-4673-1156-4
Electronic_ISBN :
978-1-4673-1155-7
Type :
conf
DOI :
10.1109/ISCIT.2012.6380943
Filename :
6380943
Link To Document :
بازگشت