Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning

Many goal-reaching reinforcement learning (RL) tasks have empirically verified that rewarding the agent on subgoals improves convergence speed and practical performance. We attempt to provide a theoretical framework to quantify the computational benefits of rewarding the completion of subgoals, in t...

Full description

Bibliographic Details
Main Authors:	Baek, C. (Author), Jiao, J. (Author), Ma, Y. (Author), Zhai, Y. (Author), Zhou, Z. (Author)
Format:	Article
Language:	English
Published:	AI Access Foundation 2022
Subjects:	Computational complexity Convergence speed Economic and social effects Graph theory Intermediate state Multipath Performance Policy learning Reinforcement learning Short-path Single path Subgoals Theoretical framework Value iteration
Online Access:	View Fulltext in Publisher

Internet

View Fulltext in Publisher

Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning

Internet

Similar Items