مرکز منطقه ای اطلاع رساني علوم و فناوري - Low-complexity reinforcement learning for delay-sensitive compression in networked video stream mining

DocumentCode :

3153737

Title :

Low-complexity reinforcement learning for delay-sensitive compression in networked video stream mining

Author :

Xiaoqing Zhu ; Lany, Cuiling ; van der Schaarz, Mihaela

Author_Institution :

Adv. Archit. & Res., Cisco Syst. Inc., San Jose, CA, USA

fYear :

2013

fDate :

15-19 July 2013

Firstpage :

Lastpage :

Abstract :

In networked video stream mining systems, real-time video contents are captured remotely and, subsequently, encoded and transmitted over bandwidth-constrained networks for classification at the receiver. One key task at the encoder is to adapt its compression on the fly based on time-varying network bandwidth and video characteristics - while attaining low delay and high classification accuracy. In this paper, we formalize the decision at the encoder side as an infinite horizon Markov Decision Process (MDP). We employ low-complexity, model-free reinforcement learning schemes to solve this problem efficiently under dynamic and unknown environment. Our proposed scheme adopts the technique of virtual experience (VE) update to drastically speed up convergence over conventional Q-learning, allowing the encoder to react to abrupt network changes on the order of minutes, instead of hours. In comparison to myopic optimization, it consistently achieves higher overall reward and lower sending delay under various network conditions.

Keywords :

Markov processes; data compression; decision theory; image classification; learning (artificial intelligence); video coding; video streaming; MDP; bandwidth-constrained network; classification accuracy; delay-sensitive compression; encoder side; infinite horizon Markov decision process; low-complexity reinforcement learning; model-free reinforcement learning scheme; networked video stream mining; real-time video content; time-varying network bandwidth; video characteristics; virtual experience; Accuracy; Bandwidth; Delays; Encoding; Feature extraction; Optimization; Streaming media; Markov decision process (MDP); human action recognition; networked video stream mining; reinforcement learning Q-learning with virtual experience (VE) update;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia and Expo (ICME), 2013 IEEE International Conference on

Conference_Location :

San Jose, CA

ISSN :

1945-7871

Type :

conf

DOI :

10.1109/ICME.2013.6607600

Filename :

6607600

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3153737