Deep Deterministic Policy Gradient Based on Double Network Prioritized Experience Replay

Deep Deterministic Policy Gradient Based on Double Network Prioritized Experience Replay

The traditional deep deterministic policy gradient (DDPG) algorithm has the disadvantages of slow convergence velocity and ease of falling into the local optimum. From these two perspectives, a DDPG algorithm based on the double network prioritized experience replay mechanism (DNPER-DDPG) is propose...

Full description

Bibliographic Details
Main Authors:	Chaohai Kang, Chuiting Rong, Weijian Ren, Fengcai Huo, Pengyun Liu
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Access
Subjects:	Continuous action space deep deterministic policy gradient experience replay mechanism function approximation error priority division
Online Access:	https://ieeexplore.ieee.org/document/9409070/

Similar Items

Motion Planning of Robot Manipulators for a Smoother Path Using a Twin Delayed Deep Deterministic Policy Gradient with Hindsight Experience Replay
by: MyeongSeop Kim, et al.
Published: (2020-01-01)

THE FUTURE, THE CRISIS, AND THE FUTURE OF REPLAY STORY
by: Eleonora Teresa Imbierowicz
Published: (2021-06-01)

Self-Adaptive Priority Correction for Prioritized Experience Replay
by: Hongjie Zhang, et al.
Published: (2020-10-01)

Trajectory Based Prioritized Double Experience Buffer for Sample-Efficient Policy Optimization
by: Shengxiang Li, et al.
Published: (2021-01-01)

On Event Reproduction Ratio in Stateless and Stateful Replay of Real-World Traffic
by: Ying-Dar Lin, et al.
Published: (2013-09-01)

Deep Deterministic Policy Gradient With Prioritized Sampling for Power Control
by: Shiyang Zhou, et al.
Published: (2020-01-01)

Hierarchical Intermittent Motor Control With Deterministic Policy Gradient
by: Haibo Shi, et al.
Published: (2019-01-01)

Enhanced Off-Policy Reinforcement Learning With Focused Experience Replay
by: Seung-Hyun Kong, et al.
Published: (2021-01-01)

Replay Debugger for Human Interactive Multiple Threaded Android Applications
Published: (2012)

Replay as wavefronts and theta sequences as bump oscillations in a grid cell attractor network
by: Louis Kang, et al.
Published: (2019-11-01)

Replay Debugger For Multi Threaded Android Applications
Published: (2011)

VANET Routing Replay Attack Detection Research Based on SVM
by: Fan Qing Gang, et al.
Published: (2016-01-01)

DESIGN AND IMPLEMENTATION OF RICH COMMUNICATION SERVICE SCENARIO REPLAYER AND PERFORMANCE EVALUATION OF APPLICATION SERVICE
by: Yellakonda, Amulya
Published: (2015)

The Story of the Nearest Relative: Shifts in Footing in Dramaturgical Replayings
by: Lisa Morriss, et al.
Published: (2019-06-01)

Real-Time Optimal Power Flow Using Twin Delayed Deep Deterministic Policy Gradient Algorithm
by: Jong Ha Woo, et al.
Published: (2020-01-01)

Audio recordings dataset of genuine and replayed speech at both ends of a telecommunication channel
by: Wei Shang, et al.
Published: (2021-02-01)

O replay na teletransmissão esportiva a partir do tempo morto do futebol
by: Marcio Telles
Published: (2014-07-01)

A Hippocampal Model for Behavioral Time Acquisition and Fast Bidirectional Replay of Spatio-Temporal Memory Sequences
by: Marcelo Matheus Gauy, et al.
Published: (2018-12-01)

Temporally delayed linear modelling (TDLM) measures replay in both animals and humans
by: Yunzhe Liu, et al.
Published: (2021-06-01)

Replay of Learned Neural Firing Sequences during Rest in Human Motor Cortex
by: Jean-Baptiste Eichenlaub, et al.
Published: (2020-05-01)

Projective Replay Analysis: A Reflective Approach for Aligning Educational Games to Their Goals
by: Harpstead, Erik
Published: (2017)

AUDIO-REPLAY ATTACKS SPOOFING DETECTION FOR SPEAKER RECOGNITION SYSTEMS
by: G. M. Lavrentyeva, et al.
Published: (2018-05-01)

Offline replay supports planning in human reinforcement learning
by: Ida Momennejad, et al.
Published: (2018-12-01)

Hippocampal replay of experience at real-world speeds
by: Eric L Denovellis, et al.
Published: (2021-09-01)

The roles of online and offline replay in planning
by: Eran Eldar, et al.
Published: (2020-06-01)

Synthetic Birdsongs as a Tool to Induce, and Iisten to, Replay Activity in Sleeping Birds
by: Ana Amador, et al.
Published: (2021-07-01)

Replay Speech Detection Based on Dual-Input Hierarchical Fusion Network
by: Hu, C., et al.
Published: (2023)

Agent-Based Energy Sharing Mechanism Using Deep Deterministic Policy Gradient Algorithm
by: Yi Kuang, et al.
Published: (2020-09-01)

Replays of spatial memories suppress topological fluctuations in cognitive map
by: Andrey Babichev, et al.
Published: (2019-07-01)

An Interactive Traffic Replay Method in a Scaled-Down Environment
by: Hongri Liu, et al.
Published: (2019-01-01)

Platform-independent reverse debugging of the virtual machines
by: Pavel Dovgalyuk, et al.
Published: (2016-04-01)

Residential Demand Response Strategy Based on Deep Deterministic Policy Gradient
by: Chunyu Deng, et al.
Published: (2021-04-01)

On gradient flow and entropy solutions for nonlocal transport equations with nonlinear mobility
by: Fagioli, S., et al.
Published: (2022)

Episodic-like memory trace in awake replay of hippocampal place cell activity sequences
by: Susumu Takahashi
Published: (2015-10-01)

Deep Deterministic Policy Gradient Based Energy Management Strategy for Hybrid Electric Tracked Vehicle With Online Updating Mechanism
by: Zhikai Ma, et al.
Published: (2021-01-01)

Spatial Replay Protection for Proximity Services : Security and privacy aspects
by: Lindblom, Fredrik
Published: (2016)

Fast-backward replay of sequentially memorized items in humans
by: Qiaoli Huang, et al.
Published: (2018-10-01)

Experience Replay Using Transition Sequences
by: Thommen George Karimpanal, et al.
Published: (2018-06-01)

A New Replay Attack Against Automatic Speaker Verification Systems
by: Sung-Hyun Yoon, et al.
Published: (2020-01-01)

Control Method for PEMFC Using Improved Deep Deterministic Policy Gradient Algorithm
by: Jiawen Li, et al.
Published: (2021-09-01)

Cannot write session to /tmp/vufind_sessions/sess_nm4l97nvpm2tu06s45ouc9ktc7