Policy Return: A New Method for Reducing the Number of Experimental Trials in Deep Reinforcement Learning
Using the same algorithm and hyperparameter configurations, deep reinforcement learning (DRL) will derive drastically different results from multiple experimental trials, and most of these results are unsatisfactory. Because of the instability of the results, researchers have to perform many trials...
Main Authors: | Feng Liu, Shuling Dai, Yongjia Zhao |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9298771/ |
Similar Items
-
IBUPROFEN IN COMPLEX TREATMENT OF FEVER IN CHILDREN
by: А.А. Alexeeva, et al.
Published: (2011-11-01) -
IBUPROFEN IN COMPLEX TREATMENT OF FEVER IN CHILDREN
by: А.А. Alexeeva, et al.
Published: (2011-01-01) -
Poor Oral Health and Diet in Relation to Weight Loss, Stable Underweight, and Obesity in Community-Dwelling Older Adults: A Cross-Sectional Study From the JAGES 2010 Project
by: Mieko Nakamura
Published: (2016-06-01) -
Small Business Sustainability for Longer Than 5 Years
by: Godwin, Matthew James
Published: (2019) -
Money and Pay-As-You-Go Pension
by: Masaya Yasuoka
Published: (2018-03-01)