• DocumentCode
    241270
  • Title

    An online learning approach to QoE-fair distributed rate allocation in multi-user video streaming

  • Author

    Hemmati, Mahdi ; Yassine, Abdulsalam ; Shirmohammadi, Shervin

  • Author_Institution
    Sch. of Electr. Eng. & Comput. Sci., Univ. of Ottawa, Ottawa, ON, Canada
  • fYear
    2014
  • fDate
    15-17 Dec. 2014
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    A Decentralized Partially Observable Markov Decision Process (DEC-POMDP) model of the resource allocation problem in multi-user video streaming over the Internet is formulated and developed to achieve both efficiency and fairness in terms of the expected value of end-user´s Quality of Experience (QoE) over a finite horizon. A distributed multi-agent reinforcement learning algorithm is proposed as a low-complexity solution to this decision problem. It is shown that the proposed learning algorithm would converge to the optimal value function and optimal policies of DEC-POMDP.
  • Keywords
    Markov processes; learning (artificial intelligence); quality of experience; video streaming; DEC-POMDP; QoE fair distributed rate allocation; Quality of Experience; decentralized partially observable Markov decision process; distributed multiagent reinforcement learning algorithm; multiuser video streaming; online learning approach; optimal value function; resource allocation problem; Games; Heuristic algorithms; Multimedia communication; Packet loss; Resource management; Streaming media;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communication Systems (ICSPCS), 2014 8th International Conference on
  • Conference_Location
    Gold Coast, QLD
  • Type

    conf

  • DOI
    10.1109/ICSPCS.2014.7021057
  • Filename
    7021057