DocumentCode :
3153737
Title :
Low-complexity reinforcement learning for delay-sensitive compression in networked video stream mining
Author :
Xiaoqing Zhu ; Lany, Cuiling ; van der Schaarz, Mihaela
Author_Institution :
Adv. Archit. & Res., Cisco Syst. Inc., San Jose, CA, USA
fYear :
2013
fDate :
15-19 July 2013
Firstpage :
1
Lastpage :
6
Abstract :
In networked video stream mining systems, real-time video contents are captured remotely and, subsequently, encoded and transmitted over bandwidth-constrained networks for classification at the receiver. One key task at the encoder is to adapt its compression on the fly based on time-varying network bandwidth and video characteristics - while attaining low delay and high classification accuracy. In this paper, we formalize the decision at the encoder side as an infinite horizon Markov Decision Process (MDP). We employ low-complexity, model-free reinforcement learning schemes to solve this problem efficiently under dynamic and unknown environment. Our proposed scheme adopts the technique of virtual experience (VE) update to drastically speed up convergence over conventional Q-learning, allowing the encoder to react to abrupt network changes on the order of minutes, instead of hours. In comparison to myopic optimization, it consistently achieves higher overall reward and lower sending delay under various network conditions.
Keywords :
Markov processes; data compression; decision theory; image classification; learning (artificial intelligence); video coding; video streaming; MDP; bandwidth-constrained network; classification accuracy; delay-sensitive compression; encoder side; infinite horizon Markov decision process; low-complexity reinforcement learning; model-free reinforcement learning scheme; networked video stream mining; real-time video content; time-varying network bandwidth; video characteristics; virtual experience; Accuracy; Bandwidth; Delays; Encoding; Feature extraction; Optimization; Streaming media; Markov decision process (MDP); human action recognition; networked video stream mining; reinforcement learning Q-learning with virtual experience (VE) update;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo (ICME), 2013 IEEE International Conference on
Conference_Location :
San Jose, CA
ISSN :
1945-7871
Type :
conf
DOI :
10.1109/ICME.2013.6607600
Filename :
6607600
Link To Document :
بازگشت