Title :
Reinforcement learning cooperative congestion control for multimedia networks
Author :
Kao-Shing Hwang ; Cheng-Shong Wu ; Hui-Kai Su
Author_Institution :
Dept. of Electr. Eng., Nat. Cheng Kung Univ., Chia-Yi, Taiwan
fDate :
27 June-3 July 2005
Abstract :
A cooperative congestion control based on the learning approach to solve congestion control problems on multimedia networks is presented. The proposed controller, which is capable of rate-based predictive control, consists of two sub-systems: a long-term policy critic and a short-term rate-adaptor. Each controller in a chained network jointly learns the control policy by real-time interactions without prior knowledge of a network model. Furthermore, a cooperative fuzzy reward evaluator provides cooperative reinforcement signals based on game theory to train controllers to adapt to dynamic network environment. The well-trained controllers can take correct actions adaptively to regulate source flow to simultaneously meet the requirements of high link utilization, low packet loss rate (PLR) and end-to-end delay. Simulation results show that the proposed approach is very effective in controlling congestion of the multimedia traffic in Internet networks.
Keywords :
Internet; cooperative systems; game theory; learning (artificial intelligence); multimedia communication; predictive control; telecommunication congestion control; telecommunication traffic; Internet networks; cooperative fuzzy reward evaluator; cooperative reinforcement signals; end-to-end delay; game theory; link utilization; long-term policy critic; multimedia networks; multimedia traffic; packet loss rate; rate-based predictive control; reinforcement learning cooperative congestion control; short-term rate-adaptor; Asynchronous transfer mode; Communication system control; Communication system traffic control; Feedback control; Game theory; Learning; Predictive control; Propagation delay; Spine; Traffic control;
Conference_Titel :
Information Acquisition, 2005 IEEE International Conference on
Print_ISBN :
0-7803-9303-1
DOI :
10.1109/ICIA.2005.1635085