Title :
An online learning approach to QoE-fair distributed rate allocation in multi-user video streaming
Author :
Hemmati, Mahdi ; Yassine, Abdulsalam ; Shirmohammadi, Shervin
Author_Institution :
Sch. of Electr. Eng. & Comput. Sci., Univ. of Ottawa, Ottawa, ON, Canada
Abstract :
A Decentralized Partially Observable Markov Decision Process (DEC-POMDP) model of the resource allocation problem in multi-user video streaming over the Internet is formulated and developed to achieve both efficiency and fairness in terms of the expected value of end-user´s Quality of Experience (QoE) over a finite horizon. A distributed multi-agent reinforcement learning algorithm is proposed as a low-complexity solution to this decision problem. It is shown that the proposed learning algorithm would converge to the optimal value function and optimal policies of DEC-POMDP.
Keywords :
Markov processes; learning (artificial intelligence); quality of experience; video streaming; DEC-POMDP; QoE fair distributed rate allocation; Quality of Experience; decentralized partially observable Markov decision process; distributed multiagent reinforcement learning algorithm; multiuser video streaming; online learning approach; optimal value function; resource allocation problem; Games; Heuristic algorithms; Multimedia communication; Packet loss; Resource management; Streaming media;
Conference_Titel :
Signal Processing and Communication Systems (ICSPCS), 2014 8th International Conference on
Conference_Location :
Gold Coast, QLD
DOI :
10.1109/ICSPCS.2014.7021057