A Confrontation Decision-Making Method with Deep Reinforcement Learning and Knowledge Transfer for Multi-Agent System

In this paper, deep reinforcement learning (DRL) and knowledge transfer are used to achieve the effective control of the learning agent for the confrontation in the multi-agent systems. Firstly, a multi-agent Deep Deterministic Policy Gradient (DDPG) algorithm with parameter sharing is proposed to a...

Full description

Bibliographic Details
Main Author:	Chunyang Hu
Format:	Article
Language:	English
Published:	MDPI AG 2020-04-01
Series:	Symmetry
Subjects:	deep deterministic policy gradient policy-based RL knowledge transfer momentum mechanism multi-agent systems
Online Access:	https://www.mdpi.com/2073-8994/12/4/631

id	doaj-0b08639e6f584fbebd7f8aaab60e5b78
record_format	Article
spelling	doaj-0b08639e6f584fbebd7f8aaab60e5b782020-11-25T02:55:17ZengMDPI AGSymmetry2073-89942020-04-011263163110.3390/sym12040631A Confrontation Decision-Making Method with Deep Reinforcement Learning and Knowledge Transfer for Multi-Agent SystemChunyang Hu0School of Computer Engineering, Hubei University of Arts and Science, Xiangyang 441053, ChinaIn this paper, deep reinforcement learning (DRL) and knowledge transfer are used to achieve the effective control of the learning agent for the confrontation in the multi-agent systems. Firstly, a multi-agent Deep Deterministic Policy Gradient (DDPG) algorithm with parameter sharing is proposed to achieve confrontation decision-making of multi-agent. In the process of training, the information of other agents is introduced to the critic network to improve the strategy of confrontation. The parameter sharing mechanism can reduce the loss of experience storage. In the DDPG algorithm, we use four neural networks to generate real-time action and Q-value function respectively and use a momentum mechanism to optimize the training process to accelerate the convergence rate for the neural network. Secondly, this paper introduces an auxiliary controller using a policy-based reinforcement learning (RL) method to achieve the assistant decision-making for the game agent. In addition, an effective reward function is used to help agents balance losses of enemies and our side. Furthermore, this paper also uses the knowledge transfer method to extend the learning model to more complex scenes and improve the generalization of the proposed confrontation model. Two confrontation decision-making experiments are designed to verify the effectiveness of the proposed method. In a small-scale task scenario, the trained agent can successfully learn to fight with the competitors and achieve a good winning rate. For large-scale confrontation scenarios, the knowledge transfer method can gradually improve the decision-making level of the learning agent.https://www.mdpi.com/2073-8994/12/4/631deep deterministic policy gradientpolicy-based RLknowledge transfermomentum mechanismmulti-agent systems
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Chunyang Hu
spellingShingle	Chunyang Hu A Confrontation Decision-Making Method with Deep Reinforcement Learning and Knowledge Transfer for Multi-Agent System Symmetry deep deterministic policy gradient policy-based RL knowledge transfer momentum mechanism multi-agent systems
author_facet	Chunyang Hu
author_sort	Chunyang Hu
title	A Confrontation Decision-Making Method with Deep Reinforcement Learning and Knowledge Transfer for Multi-Agent System
title_short	A Confrontation Decision-Making Method with Deep Reinforcement Learning and Knowledge Transfer for Multi-Agent System
title_full	A Confrontation Decision-Making Method with Deep Reinforcement Learning and Knowledge Transfer for Multi-Agent System
title_fullStr	A Confrontation Decision-Making Method with Deep Reinforcement Learning and Knowledge Transfer for Multi-Agent System
title_full_unstemmed	A Confrontation Decision-Making Method with Deep Reinforcement Learning and Knowledge Transfer for Multi-Agent System
title_sort	confrontation decision-making method with deep reinforcement learning and knowledge transfer for multi-agent system
publisher	MDPI AG
series	Symmetry
issn	2073-8994
publishDate	2020-04-01
description	In this paper, deep reinforcement learning (DRL) and knowledge transfer are used to achieve the effective control of the learning agent for the confrontation in the multi-agent systems. Firstly, a multi-agent Deep Deterministic Policy Gradient (DDPG) algorithm with parameter sharing is proposed to achieve confrontation decision-making of multi-agent. In the process of training, the information of other agents is introduced to the critic network to improve the strategy of confrontation. The parameter sharing mechanism can reduce the loss of experience storage. In the DDPG algorithm, we use four neural networks to generate real-time action and Q-value function respectively and use a momentum mechanism to optimize the training process to accelerate the convergence rate for the neural network. Secondly, this paper introduces an auxiliary controller using a policy-based reinforcement learning (RL) method to achieve the assistant decision-making for the game agent. In addition, an effective reward function is used to help agents balance losses of enemies and our side. Furthermore, this paper also uses the knowledge transfer method to extend the learning model to more complex scenes and improve the generalization of the proposed confrontation model. Two confrontation decision-making experiments are designed to verify the effectiveness of the proposed method. In a small-scale task scenario, the trained agent can successfully learn to fight with the competitors and achieve a good winning rate. For large-scale confrontation scenarios, the knowledge transfer method can gradually improve the decision-making level of the learning agent.
topic	deep deterministic policy gradient policy-based RL knowledge transfer momentum mechanism multi-agent systems
url	https://www.mdpi.com/2073-8994/12/4/631
work_keys_str_mv	AT chunyanghu aconfrontationdecisionmakingmethodwithdeepreinforcementlearningandknowledgetransferformultiagentsystem AT chunyanghu confrontationdecisionmakingmethodwithdeepreinforcementlearningandknowledgetransferformultiagentsystem
_version_	1724716838452985856

A Confrontation Decision-Making Method with Deep Reinforcement Learning and Knowledge Transfer for Multi-Agent System

Similar Items