Title :
Batch reinforcement learning based dynamic optimization for polyethylene grade transitions
Author :
Yujie Wei ; Zhihua Xiong ; Yongheng Jiang ; Dexian Huang
Author_Institution :
Dept. of Autom., Tsinghua Univ., Beijing, China
Abstract :
It is necessary for polyethylene grade transitions to establish dynamic optimization in order to preserve competitiveness in the global polymer market. Although the typical technology of iterative learning control has been performed to track the reference trajectories, it is usually difficult to obtain the optimal reference trajectories. A transition from one specific grade to another specific grade can be considered as one batch, therefore, we propose a feasible scheme for dynamic optimization of polyethylene grade transitions based on the batch reinforcement learning, which can derive a best possible policy after a few batches. The scheme aims to integrate the offline reference optimization and online implementations, and explore a better control policy instead of tracking the predetermined trajectories. This designed scheme has three distinct phases, collecting observations from reference and the historical trajectories, learning a policy, and executions. The proposed method is verified by the simulated polyethylene grade transitions, and good performance has been obtained.
Keywords :
batch processing (industrial); dynamic programming; learning (artificial intelligence); polymers; production engineering computing; batch reinforcement learning; control policy; dynamic optimization; iterative learning control; polyethylene grade transitions; polymer market; reference trajectories; Approximation algorithms; Approximation methods; Decision trees; Optimization; Polyethylene; Trajectory; Dynamic optimization; batch reinforcement learning; observations; polyethylene grade transitions;
Conference_Titel :
Control Conference (CCC), 2014 33rd Chinese
Conference_Location :
Nanjing
DOI :
10.1109/ChiCC.2014.6896268