Learning Multirobot Hose Transportation and Deployment by Distributed Round-Robin Q-Learning.
Multi-Agent Reinforcement Learning (MARL) algorithms face two main difficulties: the curse of dimensionality, and environment non-stationarity due to the independent learning processes carried out by the agents concurrently. In this paper we formalize and prove the convergence of a Distributed Round...
Main Authors: | Borja Fernandez-Gauna, Ismael Etxeberria-Agiriano, Manuel Graña |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2015-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC4497621?pdf=render |
Similar Items
-
Alternatives to round robin oral reading
by: Cree, Linda June
Published: (1986) -
The EWGRD Round Robin Measurement Exercise
by: Thornton D.A., et al.
Published: (2016-01-01) -
Genetic Scheduling and Reinforcement Learning in Multirobot Systems for Intelligent Warehouses
by: Jiajia Dou, et al.
Published: (2015-01-01) -
Round-robin scheduling for multiple group communication
by: Huang, Sheng-Nan, et al.
Published: (1991) -
High speed round robin queueing networks
by: Yates, Roy D
Published: (2005)