Title :
Optimal Scheduling over Time-Varying Channels with Traffic Admission Control: Structural Results and Online Learning Algorithms
Author :
Phan, Khoa T. ; Tho Le-Ngoc ; Van der Schaar, Mihaela ; Fangwen Fu
Author_Institution :
Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, QC, Canada
Abstract :
This work studies the joint scheduling- admission control (SAC) problem for a single user over a fading channel. Specifically, the SAC problem is formulated as a constrained Markov decision process (MDP) to maximize a utility defined as a function of the throughput and queue size. The optimal throughput- queue size trade-off is investigated. Optimal policies and their structural properties (i.e., monotonicity and convexity) are derived for two models: simultaneous and sequential scheduling and admission control actions. Furthermore, we propose online learning algorithms for the optimal policies for the two models when the statistical knowledge of the time-varying traffic arrival and channel processes is unknown. The analysis and algorithm development are relied on the reformulation of the Bellman´s optimality equations using suitably defined state-value functions which can be learned online, at transmission time, using time-averaging. The learning algorithms require less complexity and converge faster than the conventional Q-learning algorithms. This work also builds a connection between the MDP based formulation and the Lyapunov optimization based formulation for the SAC problem. Illustrative results demonstrate the performance of the proposed algorithms in various settings.
Keywords :
Markov processes; fading channels; telecommunication congestion control; time-varying channels; Bellman optimality equation; Lyapunov optimization based formulation; MDP based formulation; Markov decision process; Q-learning algorithm; SAC problem; fading channel; joint scheduling-admission control problem; online learning algorithm; optimal throughput-queue size trade-off; state-value function; time-varying channel; time-varying traffic arrival process; traffic admission control; Admission control; Approximation methods; Delays; Equations; Heuristic algorithms; Mathematical model; Throughput; Markov decision process (MDP); Scheduling; learning; structural results; traffic admission control;
Journal_Title :
Wireless Communications, IEEE Transactions on
DOI :
10.1109/TW.2013.081913.121525