• DocumentCode
    2248771
  • Title

    Reinforcement learning congestion controller for multimedia surveillance system

  • Author

    Hsiao, Ming-Chang ; Hwang, Kao-Shing ; Tan, Shun-Wen ; Wu, Cheng-Shong

  • Author_Institution
    Dept. of Electr. Eng., Nat. Chung Cheng Univ., Ming-Hsiung, Taiwan
  • Volume
    3
  • fYear
    2003
  • fDate
    14-19 Sept. 2003
  • Firstpage
    4403
  • Abstract
    The use of reinforcement learning scheme for congestion control in factory surveillance network is presented in this paper. Traditional methods perform congestion control by means of monitoring the queue length. When the queue length is greater than a predefined threshold, the source rate is decreased at a fixed rate. However, the determination of the congested threshold and sending rate is difficult for these methods. We adopted a simple reinforcement learning method, called Adaptive Heuristic Critic (AHC), to solve the problem. The AHC controller maintains an expectation of reward and takes the best policy to control source flow. By way of learning and then taking right actions, simulation results have shown that the approach can promote the system utilization and decrease packet loss.
  • Keywords
    adaptive control; factory automation; learning (artificial intelligence); learning systems; multimedia communication; surveillance; telecommunication congestion control; adaptive heuristic critic controller; congestion controller; factory automation; factory surveillance network; multimedia surveillance system; packet loss; queue length monitoring; reinforcement learning method; sending rate; Automatic control; Communication system traffic control; Control systems; High-speed networks; Learning; Multimedia systems; Multiplexing; Neural networks; Surveillance; Traffic control;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics and Automation, 2003. Proceedings. ICRA '03. IEEE International Conference on
  • ISSN
    1050-4729
  • Print_ISBN
    0-7803-7736-2
  • Type

    conf

  • DOI
    10.1109/ROBOT.2003.1242282
  • Filename
    1242282