Title :
A Q-Learning solution for adaptive video streaming
Author :
Marinca, Dana ; Barth, Dominique ; De Vleeschauwer, Danny
Author_Institution :
PRiSM, Univ. of Versailles, Versailles, France
Abstract :
Adaptive streaming is a promising technique for video streaming service to cope with the quality degradation of mobile users´ connections. In this paper we propose a service layer control mechanism for video flows based on a Reinforcement Learning (RL) paradigm that will gracefully degrade the video quality experienced by the end-user depending on the connection status. Using layered coded videos, the end-user should find the most appropriate quality level for its stream. The adaptive streaming problem can naturally be modeled as a Partial Observable Markov Decision Process (POMDP) because the end-user has partial information about the network state based on the received throughput, but this model cannot be applied on-line during streaming. We propose here an MDP modeling the adaptive streaming problem that can be solved on-line by the Q-Learning algorithm. Both models have identical solutions proving the validity of the proposed MDP model.
Keywords :
Markov processes; learning (artificial intelligence); mobile radio; telecommunication computing; video streaming; POMDP; Q-learning solution; adaptive video streaming; layered coded videos; mobile users connections; partial observable Markov decision process; quality degradation; reinforcement learning paradigm; service layer control mechanism; Conferences; Decision support systems; Mobile communication; Mobile computing; Wireless communication; MDP; POMDP; Q-Learning; adaptive video streaming; layered coded video; reinforcement learning; wireless users;
Conference_Titel :
Mobile and Wireless Networking (MoWNeT), 2013 International Conference on Selected Topics in
Conference_Location :
Montre??al, QC
DOI :
10.1109/MoWNet.2013.6613807