Distributed Policy Evaluation with Fractional Order Dynamics in Multiagent Reinforcement Learning

The main objective of multiagent reinforcement learning is to achieve a global optimal policy. It is difficult to evaluate the value function with high-dimensional state space. Therefore, we transfer the problem of multiagent reinforcement learning into a distributed optimization problem with constr...

Full description

Bibliographic Details
Main Authors:	Wei Dai, Wei Wang, Zhongtian Mao, Ruwen Jiang, Fudong Nian, Teng Li
Format:	Article
Language:	English
Published:	Hindawi-Wiley 2021-01-01
Series:	Security and Communication Networks
Online Access:	http://dx.doi.org/10.1155/2021/1020466

Internet

http://dx.doi.org/10.1155/2021/1020466

Distributed Policy Evaluation with Fractional Order Dynamics in Multiagent Reinforcement Learning

Internet

Similar Items