Counterfactual-Based Action Evaluation Algorithm in Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning (MARL) algorithms have made great achievements in various scenarios, but there are still many problems in solving sequential social dilemmas (SSDs). In SSDs, the agent’s actions not only change the instantaneous state of the environment but also affect the latent s...

Full description

Bibliographic Details
Main Authors:	Guo, T. (Author), Jiang, H. (Author), Yuan, Y. (Author), Zhao, P. (Author)
Format:	Article
Language:	English
Published:	MDPI 2022
Subjects:	actor-critic counterfactual reasoning intrinsic reward multi-agent reinforcement learning multi-agent system social dilemmas
Online Access:	View Fulltext in Publisher

Internet

View Fulltext in Publisher

Counterfactual-Based Action Evaluation Algorithm in Multi-Agent Reinforcement Learning

Internet

Similar Items