Deep Deterministic Policy Gradient Based Energy Management Strategy for Hybrid Electric Tracked Vehicle With Online Updating Mechanism
In this paper, an online energy management strategy (EMS) for hybrid electric tracked vehicle (HETV) is developed based on deep deterministic policy gradient (DDPG) with time-varying weighting factor to further improve economic performance of HETV and reduce computational burden. The DDPG is applied...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9312596/ |
id |
doaj-678ecf97af3a4e14a406cf45824c57e6 |
---|---|
record_format |
Article |
spelling |
doaj-678ecf97af3a4e14a406cf45824c57e62021-03-30T15:17:40ZengIEEEIEEE Access2169-35362021-01-0197280729210.1109/ACCESS.2020.30489669312596Deep Deterministic Policy Gradient Based Energy Management Strategy for Hybrid Electric Tracked Vehicle With Online Updating MechanismZhikai Ma0https://orcid.org/0000-0002-5643-6589Qian Huo1Tao Zhang2https://orcid.org/0000-0001-6804-4147Jianjun Hao3Wei Wang4College of Mechanical and Electrical Engineering, Hebei Agricultural University, Baoding, ChinaCollege of Mechanical and Electrical Engineering, Hebei Agricultural University, Baoding, ChinaChina North Vehicle Research Institute, Beijing, ChinaCollege of Mechanical and Electrical Engineering, Hebei Agricultural University, Baoding, ChinaCollege of Mechanical and Electrical Engineering, Hebei Agricultural University, Baoding, ChinaIn this paper, an online energy management strategy (EMS) for hybrid electric tracked vehicle (HETV) is developed based on deep deterministic policy gradient (DDPG) with time-varying weighting factor to further improve economic performance of HETV and reduce computational burden. The DDPG is applied to model the EMS problem for the target HETV. Especially, a time-varying weighting factor is introduced here to update old network parameters with experience learned from most recent cycle segment. Afterwards, simulation is conducted to verify the effectiveness and adaptability of the proposed method. Results show that DDPG-based EMS with online updating mechanism can achieve nearly 90% fuel economy performance as that of dynamic programming while computational time is greatly reduced. Finally, hardware-in-loop experiment is carried out to evaluate the real-world performance of the proposed method.https://ieeexplore.ieee.org/document/9312596/Energy managementhybrid electric tracked vehicleonline updating mechanismdeep deterministic policy gradient |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zhikai Ma Qian Huo Tao Zhang Jianjun Hao Wei Wang |
spellingShingle |
Zhikai Ma Qian Huo Tao Zhang Jianjun Hao Wei Wang Deep Deterministic Policy Gradient Based Energy Management Strategy for Hybrid Electric Tracked Vehicle With Online Updating Mechanism IEEE Access Energy management hybrid electric tracked vehicle online updating mechanism deep deterministic policy gradient |
author_facet |
Zhikai Ma Qian Huo Tao Zhang Jianjun Hao Wei Wang |
author_sort |
Zhikai Ma |
title |
Deep Deterministic Policy Gradient Based Energy Management Strategy for Hybrid Electric Tracked Vehicle With Online Updating Mechanism |
title_short |
Deep Deterministic Policy Gradient Based Energy Management Strategy for Hybrid Electric Tracked Vehicle With Online Updating Mechanism |
title_full |
Deep Deterministic Policy Gradient Based Energy Management Strategy for Hybrid Electric Tracked Vehicle With Online Updating Mechanism |
title_fullStr |
Deep Deterministic Policy Gradient Based Energy Management Strategy for Hybrid Electric Tracked Vehicle With Online Updating Mechanism |
title_full_unstemmed |
Deep Deterministic Policy Gradient Based Energy Management Strategy for Hybrid Electric Tracked Vehicle With Online Updating Mechanism |
title_sort |
deep deterministic policy gradient based energy management strategy for hybrid electric tracked vehicle with online updating mechanism |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2021-01-01 |
description |
In this paper, an online energy management strategy (EMS) for hybrid electric tracked vehicle (HETV) is developed based on deep deterministic policy gradient (DDPG) with time-varying weighting factor to further improve economic performance of HETV and reduce computational burden. The DDPG is applied to model the EMS problem for the target HETV. Especially, a time-varying weighting factor is introduced here to update old network parameters with experience learned from most recent cycle segment. Afterwards, simulation is conducted to verify the effectiveness and adaptability of the proposed method. Results show that DDPG-based EMS with online updating mechanism can achieve nearly 90% fuel economy performance as that of dynamic programming while computational time is greatly reduced. Finally, hardware-in-loop experiment is carried out to evaluate the real-world performance of the proposed method. |
topic |
Energy management hybrid electric tracked vehicle online updating mechanism deep deterministic policy gradient |
url |
https://ieeexplore.ieee.org/document/9312596/ |
work_keys_str_mv |
AT zhikaima deepdeterministicpolicygradientbasedenergymanagementstrategyforhybridelectrictrackedvehiclewithonlineupdatingmechanism AT qianhuo deepdeterministicpolicygradientbasedenergymanagementstrategyforhybridelectrictrackedvehiclewithonlineupdatingmechanism AT taozhang deepdeterministicpolicygradientbasedenergymanagementstrategyforhybridelectrictrackedvehiclewithonlineupdatingmechanism AT jianjunhao deepdeterministicpolicygradientbasedenergymanagementstrategyforhybridelectrictrackedvehiclewithonlineupdatingmechanism AT weiwang deepdeterministicpolicygradientbasedenergymanagementstrategyforhybridelectrictrackedvehiclewithonlineupdatingmechanism |
_version_ |
1724179695822438400 |