Pareto Optimal Solutions for Network Defense Strategy Selection Simulator in Multi-Objective Reinforcement Learning

Using Pareto optimization in Multi-Objective Reinforcement Learning (MORL) leads to better learning results for network defense games. This is particularly useful for network security agents, who must often balance several goals when choosing what action to take in defense of a network. If the defen...

Full description

Bibliographic Details
Main Authors: Yang Sun, Yun Li, Wei Xiong, Zhonghua Yao, Krishna Moniz, Ahmed Zahir
Format: Article
Language:English
Published: MDPI AG 2018-01-01
Series:Applied Sciences
Subjects:
Online Access:http://www.mdpi.com/2076-3417/8/1/136