Distributed Policy Evaluation with Fractional Order Dynamics in Multiagent Reinforcement Learning

The main objective of multiagent reinforcement learning is to achieve a global optimal policy. It is difficult to evaluate the value function with high-dimensional state space. Therefore, we transfer the problem of multiagent reinforcement learning into a distributed optimization problem with constr...

Full description

Bibliographic Details
Main Authors: Wei Dai, Wei Wang, Zhongtian Mao, Ruwen Jiang, Fudong Nian, Teng Li
Format: Article
Language:English
Published: Hindawi-Wiley 2021-01-01
Series:Security and Communication Networks
Online Access:http://dx.doi.org/10.1155/2021/1020466