Upper Bounds on the Performance of Discretisation in Reinforcement Learning
Reinforcement learning is a machine learning framework whereby an agent learns to perform a task by maximising its total reward received for selecting actions in each state. The policy mapping states to actions that the agent learns is either represented explicitly, or implicitly through a value fun...
Main Author: | Michael Robin Mitchley |
---|---|
Format: | Article |
Language: | English |
Published: |
South African Institute of Computer Scientists and Information Technologists
2015-12-01
|
Series: | South African Computer Journal |
Subjects: | |
Online Access: | http://sacj.cs.uct.ac.za/index.php/sacj/article/view/284 |
Similar Items
-
Reinforcement Learning in Keepaway Framework for RoboCup Simulation League
by: Li, Wei
Published: (2011) -
Autonomous Mobility Management for 5G Ultra-Dense HetNets via Reinforcement Learning With Tile Coding Function Approximation
by: Qianyu Liu, et al.
Published: (2021-01-01) -
Multi-Agent Deep Reinforcement Learning-Based Cooperative Spectrum Sensing With Upper Confidence Bound Exploration
by: Yu Zhang, et al.
Published: (2019-01-01) -
Hierarchical Reinforcement Learning with Function Approximation for Adaptive Control
by: Skelly, Margaret Mary
Published: (2004) -
Adaptive value function approximation in reinforcement learning using wavelets
by: Mitchley, Michael
Published: (2016)