DocumentCode :
1299991
Title :
Fast Reinforcement Learning for Energy-Efficient Wireless Communication
Author :
Mastronarde, Nicholas ; Van der Schaar, Mihaela
Author_Institution :
State Univ. of New York at Buffalo, Buffalo, NY, USA
Volume :
59
Issue :
12
fYear :
2011
Firstpage :
6262
Lastpage :
6266
Abstract :
We consider the problem of energy-efficient point-to-point transmission of delay-sensitive data (e.g., multimedia data) over a fading channel. We propose a rigorous and unified framework for simultaneously utilizing both physical-layer and system-level techniques to minimize energy consumption, under delay constraints, in the presence of stochastic and unknown traffic and channel conditions. We formulate the problem as a Markov decision process and solve it online using reinforcement learning. The advantages of the proposed online method are that i) it does not require a priori knowledge of the traffic arrival and channel statistics to determine the jointly optimal physical-layer and system-level power management strategies; ii) it exploits partial information about the system so that less information needs to be learned than when using conventional reinforcement learning algorithms; and iii) it obviates the need for action exploration, which severely limits the adaptation speed and run-time performance of conventional reinforcement learning algorithms.
Keywords :
Markov processes; fading channels; learning (artificial intelligence); radiocommunication; telecommunication computing; telecommunication traffic; Markov decision process; channel statistics; delay constraints; delay-sensitive data; energy consumption; energy-efficient point-to-point transmission; energy-efficient wireless communication; fading channel; fast reinforcement learning algorithm; physical-layer; system-level power management strategy; system-level techniques; traffic arrival; Delay; Fading; Heuristic algorithms; Learning; Markov processes; Multimedia communication; Wireless communication; Energy-efficient wireless multimedia communication; Markov decision process; adaptive modulation and coding; dynamic power management; power-control; reinforcement learning;
fLanguage :
English
Journal_Title :
Signal Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1053-587X
Type :
jour
DOI :
10.1109/TSP.2011.2165211
Filename :
5986747
Link To Document :
بازگشت