DocumentCode :
672453
Title :
Introducing strategic measure actions in multi-armed bandits
Author :
Boldrini, Stefano ; Fiorina, Jocelyn ; Di Benedetto, Maria-Gabriella
Author_Institution :
Dept. of Inf. Eng., Electron. & Telecommun. (DIET), Sapienza Univ. of Rome, Rome, Italy
fYear :
2013
fDate :
8-9 Sept. 2013
Firstpage :
41
Lastpage :
45
Abstract :
Multi-armed bandits may be used for modelling the process of selecting one among different wireless networks, given a set of system constraints typically formed by user-perceived network quality indicators. This work proposes a novel multi-armed bandit, that is made appropriate to the above context by introducing a distinction between two actions, to measure and to use, in order to better reflect real communication application scenarios. The impact of this introduction is analysed through simulations by comparing a traditional multi-armed bandit algorithm against methods that integrate the new concept of measuring vs. using. Results show that performance in terms of regret can be significantly improved using the proposed algorithms if the period needed for measuring is at least 3 times shorter than the one for the using action. The classical method would require a significantly shorter measuring period to reach the same regret, i.e. much stricter constraints on the allowed measure action duration.
Keywords :
probability; radio networks; multiarmed bandit algorithm; user- perceived network quality indicators; wireless network; Algorithm design and analysis; Analytical models; Gain measurement; Performance evaluation; Phase measurement; Time measurement; Wireless networks; Multi-armed bandit; UCB; exploitation; exploration; learning; regret; wireless network selection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Personal, Indoor and Mobile Radio Communications (PIMRC Workshops), 2013 IEEE 24th International Symposium on
Conference_Location :
London
Type :
conf
DOI :
10.1109/PIMRCW.2013.6707833
Filename :
6707833
Link To Document :
بازگشت