• DocumentCode
    3605247
  • Title

    Reinforcement Learning of Adaptive Energy Management With Transition Probability for a Hybrid Electric Tracked Vehicle

  • Author

    Teng Liu ; Yuan Zou ; Dexing Liu ; Fengchun Sun

  • Author_Institution
    Beijing Collaborative & Innovative Center for Electr. Vehicles, Beijing Inst. of Technol., Beijing, China
  • Volume
    62
  • Issue
    12
  • fYear
    2015
  • Firstpage
    7837
  • Lastpage
    7846
  • Abstract
    A reinforcement learning-based adaptive energy management (RLAEM) is proposed for a hybrid electric tracked vehicle (HETV) in this paper. A control oriented model of the HETV is first established, in which the state-of-charge (SOC) of battery and the speed of generator are the state variables, and the engine´s torque is the control variable. Subsequently, a transition probability matrix is learned from a specific driving schedule of the HETV. The proposed RLAEM decides appropriate power split between the battery and engine-generator set (EGS) to minimize the fuel consumption over different driving schedules. With the RLAEM, not only is driver´s power requirement guaranteed, but also the fuel economy is improved as well. Finally, the RLAEM is compared with the stochastic dynamic programming (SDP)-based energy management for different driving schedules. The simulation results demonstrate the adaptability, optimality, and learning ability of the RLAEM and its capacity of reducing the computation time.
  • Keywords
    dynamic programming; energy management systems; fuel economy; hybrid electric vehicles; learning (artificial intelligence); matrix algebra; minimisation; probability; stochastic programming; torque control; EGS; HETV; RLAEM; SDP-based energy management; SoC; computation time reduction; control oriented model; driving schedule; engine torque; engine-generator set; fuel consumption minimisation; fuel economy; hybrid electric tracked vehicle; reinforcement learning-based adaptive energy management; state of charge; state variables; stochastic dynamic programming (; transition probability matrix; Batteries; Energy management; Fuels; Hybrid electric vehicles; Optimal control; Schedules; $Q$-learning algorithm; Adaptability; Q-learning algorithm; energy management; hybrid electric tracked vehicle; hybrid electric tracked vehicle (HETV); programming (SDP); state of charge (SOC); stochastic dynamic; stochastic dynamic programming (SDP);
  • fLanguage
    English
  • Journal_Title
    Industrial Electronics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0278-0046
  • Type

    jour

  • DOI
    10.1109/TIE.2015.2475419
  • Filename
    7234919