Learning Multirobot Hose Transportation and Deployment by Distributed Round-Robin Q-Learning.

Multi-Agent Reinforcement Learning (MARL) algorithms face two main difficulties: the curse of dimensionality, and environment non-stationarity due to the independent learning processes carried out by the agents concurrently. In this paper we formalize and prove the convergence of a Distributed Round...

Full description

Bibliographic Details
Main Authors:	Borja Fernandez-Gauna, Ismael Etxeberria-Agiriano, Manuel Graña
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2015-01-01
Series:	PLoS ONE
Online Access:	http://europepmc.org/articles/PMC4497621?pdf=render

Internet

http://europepmc.org/articles/PMC4497621?pdf=render

Learning Multirobot Hose Transportation and Deployment by Distributed Round-Robin Q-Learning.

Internet

Similar Items