DocumentCode
672453
Title
Introducing strategic measure actions in multi-armed bandits
Author
Boldrini, Stefano ; Fiorina, Jocelyn ; Di Benedetto, Maria-Gabriella
Author_Institution
Dept. of Inf. Eng., Electron. & Telecommun. (DIET), Sapienza Univ. of Rome, Rome, Italy
fYear
2013
fDate
8-9 Sept. 2013
Firstpage
41
Lastpage
45
Abstract
Multi-armed bandits may be used for modelling the process of selecting one among different wireless networks, given a set of system constraints typically formed by user-perceived network quality indicators. This work proposes a novel multi-armed bandit, that is made appropriate to the above context by introducing a distinction between two actions, to measure and to use, in order to better reflect real communication application scenarios. The impact of this introduction is analysed through simulations by comparing a traditional multi-armed bandit algorithm against methods that integrate the new concept of measuring vs. using. Results show that performance in terms of regret can be significantly improved using the proposed algorithms if the period needed for measuring is at least 3 times shorter than the one for the using action. The classical method would require a significantly shorter measuring period to reach the same regret, i.e. much stricter constraints on the allowed measure action duration.
Keywords
probability; radio networks; multiarmed bandit algorithm; user- perceived network quality indicators; wireless network; Algorithm design and analysis; Analytical models; Gain measurement; Performance evaluation; Phase measurement; Time measurement; Wireless networks; Multi-armed bandit; UCB; exploitation; exploration; learning; regret; wireless network selection;
fLanguage
English
Publisher
ieee
Conference_Titel
Personal, Indoor and Mobile Radio Communications (PIMRC Workshops), 2013 IEEE 24th International Symposium on
Conference_Location
London
Type
conf
DOI
10.1109/PIMRCW.2013.6707833
Filename
6707833
Link To Document