Deep Deterministic Policy Gradient Based Energy Management Strategy for Hybrid Electric Tracked Vehicle With Online Updating Mechanism

In this paper, an online energy management strategy (EMS) for hybrid electric tracked vehicle (HETV) is developed based on deep deterministic policy gradient (DDPG) with time-varying weighting factor to further improve economic performance of HETV and reduce computational burden. The DDPG is applied...

Full description

Bibliographic Details
Main Authors: Zhikai Ma, Qian Huo, Tao Zhang, Jianjun Hao, Wei Wang
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9312596/
id doaj-678ecf97af3a4e14a406cf45824c57e6
record_format Article
spelling doaj-678ecf97af3a4e14a406cf45824c57e62021-03-30T15:17:40ZengIEEEIEEE Access2169-35362021-01-0197280729210.1109/ACCESS.2020.30489669312596Deep Deterministic Policy Gradient Based Energy Management Strategy for Hybrid Electric Tracked Vehicle With Online Updating MechanismZhikai Ma0https://orcid.org/0000-0002-5643-6589Qian Huo1Tao Zhang2https://orcid.org/0000-0001-6804-4147Jianjun Hao3Wei Wang4College of Mechanical and Electrical Engineering, Hebei Agricultural University, Baoding, ChinaCollege of Mechanical and Electrical Engineering, Hebei Agricultural University, Baoding, ChinaChina North Vehicle Research Institute, Beijing, ChinaCollege of Mechanical and Electrical Engineering, Hebei Agricultural University, Baoding, ChinaCollege of Mechanical and Electrical Engineering, Hebei Agricultural University, Baoding, ChinaIn this paper, an online energy management strategy (EMS) for hybrid electric tracked vehicle (HETV) is developed based on deep deterministic policy gradient (DDPG) with time-varying weighting factor to further improve economic performance of HETV and reduce computational burden. The DDPG is applied to model the EMS problem for the target HETV. Especially, a time-varying weighting factor is introduced here to update old network parameters with experience learned from most recent cycle segment. Afterwards, simulation is conducted to verify the effectiveness and adaptability of the proposed method. Results show that DDPG-based EMS with online updating mechanism can achieve nearly 90% fuel economy performance as that of dynamic programming while computational time is greatly reduced. Finally, hardware-in-loop experiment is carried out to evaluate the real-world performance of the proposed method.https://ieeexplore.ieee.org/document/9312596/Energy managementhybrid electric tracked vehicleonline updating mechanismdeep deterministic policy gradient
collection DOAJ
language English
format Article
sources DOAJ
author Zhikai Ma
Qian Huo
Tao Zhang
Jianjun Hao
Wei Wang
spellingShingle Zhikai Ma
Qian Huo
Tao Zhang
Jianjun Hao
Wei Wang
Deep Deterministic Policy Gradient Based Energy Management Strategy for Hybrid Electric Tracked Vehicle With Online Updating Mechanism
IEEE Access
Energy management
hybrid electric tracked vehicle
online updating mechanism
deep deterministic policy gradient
author_facet Zhikai Ma
Qian Huo
Tao Zhang
Jianjun Hao
Wei Wang
author_sort Zhikai Ma
title Deep Deterministic Policy Gradient Based Energy Management Strategy for Hybrid Electric Tracked Vehicle With Online Updating Mechanism
title_short Deep Deterministic Policy Gradient Based Energy Management Strategy for Hybrid Electric Tracked Vehicle With Online Updating Mechanism
title_full Deep Deterministic Policy Gradient Based Energy Management Strategy for Hybrid Electric Tracked Vehicle With Online Updating Mechanism
title_fullStr Deep Deterministic Policy Gradient Based Energy Management Strategy for Hybrid Electric Tracked Vehicle With Online Updating Mechanism
title_full_unstemmed Deep Deterministic Policy Gradient Based Energy Management Strategy for Hybrid Electric Tracked Vehicle With Online Updating Mechanism
title_sort deep deterministic policy gradient based energy management strategy for hybrid electric tracked vehicle with online updating mechanism
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description In this paper, an online energy management strategy (EMS) for hybrid electric tracked vehicle (HETV) is developed based on deep deterministic policy gradient (DDPG) with time-varying weighting factor to further improve economic performance of HETV and reduce computational burden. The DDPG is applied to model the EMS problem for the target HETV. Especially, a time-varying weighting factor is introduced here to update old network parameters with experience learned from most recent cycle segment. Afterwards, simulation is conducted to verify the effectiveness and adaptability of the proposed method. Results show that DDPG-based EMS with online updating mechanism can achieve nearly 90% fuel economy performance as that of dynamic programming while computational time is greatly reduced. Finally, hardware-in-loop experiment is carried out to evaluate the real-world performance of the proposed method.
topic Energy management
hybrid electric tracked vehicle
online updating mechanism
deep deterministic policy gradient
url https://ieeexplore.ieee.org/document/9312596/
work_keys_str_mv AT zhikaima deepdeterministicpolicygradientbasedenergymanagementstrategyforhybridelectrictrackedvehiclewithonlineupdatingmechanism
AT qianhuo deepdeterministicpolicygradientbasedenergymanagementstrategyforhybridelectrictrackedvehiclewithonlineupdatingmechanism
AT taozhang deepdeterministicpolicygradientbasedenergymanagementstrategyforhybridelectrictrackedvehiclewithonlineupdatingmechanism
AT jianjunhao deepdeterministicpolicygradientbasedenergymanagementstrategyforhybridelectrictrackedvehiclewithonlineupdatingmechanism
AT weiwang deepdeterministicpolicygradientbasedenergymanagementstrategyforhybridelectrictrackedvehiclewithonlineupdatingmechanism
_version_ 1724179695822438400