Title :
Optimal self-learning battery control in smart residential grids by iterative Q-learning algorithm
Author :
Qinglai Wei ; Derong Liu ; Guang Shi ; Yu Liu ; Qiang Guan
Author_Institution :
State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China
Abstract :
In this paper, a novel dual iterative Q-learning algorithm is developed to solve the optimal battery management and control problems in smart residential environments. The main idea is to use adaptive dynamic programming (ADP) technique to obtain the optimal battery management and control scheme iteratively for residential energy systems. In the developed dual iterative Q-learning algorithm, two iterations, including external and internal iterations, are introduced, where internal iteration minimizes the total cost of power loads in each period and the external iteration makes the iterative Q function converge to the optimum. For the first time, the convergence property of iterative Q-learning method is proven to guarantee the convergence property of the iterative Q function. Finally, numerical results are given to illustrate the performance of the developed algorithm.
Keywords :
battery management systems; convergence of numerical methods; dynamic programming; iterative learning control; optimal control; smart power grids; unsupervised learning; adaptive dynamic programming technique; convergence property; dual iterative Q-learning algorithm; external iterations; internal iterations; iterative Q function; optimal battery control problems; optimal battery management; power loads; residential energy systems; smart residential environments; total cost minimization;
Conference_Titel :
Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014 IEEE Symposium on
Conference_Location :
Orlando, FL
DOI :
10.1109/ADPRL.2014.7010630