Title :
Stochastic optimal generation command dispatch based on improved hierarchical reinforcement learning approach
Author :
yu, tao ; Wang, Y.M. ; Ye, W.J. ; Zhou, B. ; Chan, Ka Wing
Author_Institution :
Coll. of Electr. Power, South China Univ. of Technol., Guangzhou, China
fDate :
8/1/2011 12:00:00 AM
Abstract :
This study presents an improved hierarchical reinforcement learning (HRL) approach to deal with the curse of dimensionality in the dynamic optimisation of generation command dispatch (GCD) for automatic generation control (AGC) under control performance standards. The AGC committed units are firstly classified into several different groups according to their time delay of frequency control, and the core problem of GCD is decomposed into a set of subtasks for search of the optimal regulation participation factors with the solution algorithm. The time-varying coordination factor is introduced in the control layer to improve the learning efficiency of HRL, and the generating error, hydro capacity margin and AGC regulating costs are formulated into Markov decision process reward function. The application of the improved hierarchical Q-learning (HQL) algorithm in the China southern power grid model shows that the proposed method can reduce the convergence time in the pre-learning process, decrease the AGC regulating cost and improve the control performance of AGC systems compared with the conventional HQL, genetic algorithm and a engineering method.
Keywords :
Markov processes; delays; genetic algorithms; learning (artificial intelligence); optimal control; power generation control; power generation dispatch; power generation economics; power grids; stochastic processes; time-varying systems; AGC regulating costs; AGC system performance; GCD; HQL; HRL; Markov decision process; automatic generation control layer; control performance standard; frequency control; genetic algorithm; hierarchical Q-learning algorithm; hydrocapacity margin; improved hierarchical reinforcement learning efficiency; optimal regulation participation factor; power grid model; prelearning process; stochastic optimal generation command dispatch; time delay; time varying coordination factor;
Journal_Title :
Generation, Transmission & Distribution, IET
DOI :
10.1049/iet-gtd.2010.0600