DocumentCode
3605247
Title
Reinforcement Learning of Adaptive Energy Management With Transition Probability for a Hybrid Electric Tracked Vehicle
Author
Teng Liu ; Yuan Zou ; Dexing Liu ; Fengchun Sun
Author_Institution
Beijing Collaborative & Innovative Center for Electr. Vehicles, Beijing Inst. of Technol., Beijing, China
Volume
62
Issue
12
fYear
2015
Firstpage
7837
Lastpage
7846
Abstract
A reinforcement learning-based adaptive energy management (RLAEM) is proposed for a hybrid electric tracked vehicle (HETV) in this paper. A control oriented model of the HETV is first established, in which the state-of-charge (SOC) of battery and the speed of generator are the state variables, and the engine´s torque is the control variable. Subsequently, a transition probability matrix is learned from a specific driving schedule of the HETV. The proposed RLAEM decides appropriate power split between the battery and engine-generator set (EGS) to minimize the fuel consumption over different driving schedules. With the RLAEM, not only is driver´s power requirement guaranteed, but also the fuel economy is improved as well. Finally, the RLAEM is compared with the stochastic dynamic programming (SDP)-based energy management for different driving schedules. The simulation results demonstrate the adaptability, optimality, and learning ability of the RLAEM and its capacity of reducing the computation time.
Keywords
dynamic programming; energy management systems; fuel economy; hybrid electric vehicles; learning (artificial intelligence); matrix algebra; minimisation; probability; stochastic programming; torque control; EGS; HETV; RLAEM; SDP-based energy management; SoC; computation time reduction; control oriented model; driving schedule; engine torque; engine-generator set; fuel consumption minimisation; fuel economy; hybrid electric tracked vehicle; reinforcement learning-based adaptive energy management; state of charge; state variables; stochastic dynamic programming (; transition probability matrix; Batteries; Energy management; Fuels; Hybrid electric vehicles; Optimal control; Schedules; $Q$-learning algorithm; Adaptability; Q-learning algorithm; energy management; hybrid electric tracked vehicle; hybrid electric tracked vehicle (HETV); programming (SDP); state of charge (SOC); stochastic dynamic; stochastic dynamic programming (SDP);
fLanguage
English
Journal_Title
Industrial Electronics, IEEE Transactions on
Publisher
ieee
ISSN
0278-0046
Type
jour
DOI
10.1109/TIE.2015.2475419
Filename
7234919
Link To Document