Efficient Robot Skills Learning with Weighted Near-Optimal Experiences Policy Optimization

Autonomous learning of robotic skills seems to be more natural and more practical than engineered skills, analogous to the learning process of human individuals. Policy gradient methods are a type of reinforcement learning technique which have great potential in solving robot skills learning problem...

Full description

Bibliographic Details
Main Authors: Liwei Hou, Hengsheng Wang, Haoran Zou, Qun Wang
Format: Article
Language:English
Published: MDPI AG 2021-01-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/11/3/1131