• DocumentCode
    2520323
  • Title

    Dynamic Spectrum Access in realistic environments using reinforcement learning

  • Author

    Myrvoll, Tor Andre ; Håkegård, Jan Erik

  • Author_Institution
    SINTEF ICT, Trondheim, Norway
  • fYear
    2012
  • fDate
    2-5 Oct. 2012
  • Firstpage
    465
  • Lastpage
    470
  • Abstract
    We study the use of reinforcement learning to model Dynamic Spectrum Access in a realistic multi-channel environment. Three different approaches from the literature on the multi-armed bandit problem are compared on a set of realistic channel access models - two are based on stochastic models of the channel occupancy, while a third assumes an adversarial model. The algorithms are experimentally tested on channels occupied by primary users that behave according to a simple fair scheduler and a semi-Markov model based on WLAN traffic measurements; models that generate more realistic channel occupancy patterns than allowed by fixed i.i.d. probability models. The experiments show that the UCB1 algorithm of Auer et. al. [1] outperforms the other algorithms, and we support these findings using some simple theoretical results.
  • Keywords
    Markov processes; learning (artificial intelligence); probability; radio spectrum management; scheduling; telecommunication computing; telecommunication traffic; wireless LAN; UCB1 algorithm; WLAN traffic measurements; adversarial model; dynamic spectrum access; fair scheduler; fixed i.i.d. probability models; multiarmed bandit problem; primary users; realistic channel access models; realistic channel occupancy patterns; realistic environments; realistic multichannel environment; reinforcement learning; semiMarkov model; stochastic models; Availability; Channel estimation; Channel models; Sensors; Stochastic processes; Switches; Wireless LAN;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications and Information Technologies (ISCIT), 2012 International Symposium on
  • Conference_Location
    Gold Coast, QLD
  • Print_ISBN
    978-1-4673-1156-4
  • Electronic_ISBN
    978-1-4673-1155-7
  • Type

    conf

  • DOI
    10.1109/ISCIT.2012.6380943
  • Filename
    6380943