Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning

Many goal-reaching reinforcement learning (RL) tasks have empirically verified that rewarding the agent on subgoals improves convergence speed and practical performance. We attempt to provide a theoretical framework to quantify the computational benefits of rewarding the completion of subgoals, in t...

Full description

Bibliographic Details
Main Authors: Baek, C. (Author), Jiao, J. (Author), Ma, Y. (Author), Zhai, Y. (Author), Zhou, Z. (Author)
Format: Article
Language:English
Published: AI Access Foundation 2022
Subjects:
Online Access:View Fulltext in Publisher