Upper Bounds on the Performance of Discretisation in Reinforcement Learning
Reinforcement learning is a machine learning framework whereby an agent learns to perform a task by maximising its total reward received for selecting actions in each state. The policy mapping states to actions that the agent learns is either represented explicitly, or implicitly through a value fun...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
South African Institute of Computer Scientists and Information Technologists
2015-12-01
|
Series: | South African Computer Journal |
Subjects: | |
Online Access: | http://sacj.cs.uct.ac.za/index.php/sacj/article/view/284 |
id |
doaj-d287087185fe41dea2f9462369799bec |
---|---|
record_format |
Article |
spelling |
doaj-d287087185fe41dea2f9462369799bec2020-11-25T02:42:09ZengSouth African Institute of Computer Scientists and Information TechnologistsSouth African Computer Journal1015-79992313-78352015-12-0105710.18489/sacj.v0i57.284145Upper Bounds on the Performance of Discretisation in Reinforcement LearningMichael Robin Mitchley0School of Computer Science and Applied Mathematics University of the Witwatersrand, JohannesburgReinforcement learning is a machine learning framework whereby an agent learns to perform a task by maximising its total reward received for selecting actions in each state. The policy mapping states to actions that the agent learns is either represented explicitly, or implicitly through a value function. It is common in reinforcement learning to discretise a continuous state space using tile coding or binary features. We prove an upper bound on the performance of discretisation for direct policy representation or value function approximation.http://sacj.cs.uct.ac.za/index.php/sacj/article/view/284Reinforcement learningTile codingPerformance BoundsAverage Case Analysis |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Michael Robin Mitchley |
spellingShingle |
Michael Robin Mitchley Upper Bounds on the Performance of Discretisation in Reinforcement Learning South African Computer Journal Reinforcement learning Tile coding Performance Bounds Average Case Analysis |
author_facet |
Michael Robin Mitchley |
author_sort |
Michael Robin Mitchley |
title |
Upper Bounds on the Performance of Discretisation in Reinforcement Learning |
title_short |
Upper Bounds on the Performance of Discretisation in Reinforcement Learning |
title_full |
Upper Bounds on the Performance of Discretisation in Reinforcement Learning |
title_fullStr |
Upper Bounds on the Performance of Discretisation in Reinforcement Learning |
title_full_unstemmed |
Upper Bounds on the Performance of Discretisation in Reinforcement Learning |
title_sort |
upper bounds on the performance of discretisation in reinforcement learning |
publisher |
South African Institute of Computer Scientists and Information Technologists |
series |
South African Computer Journal |
issn |
1015-7999 2313-7835 |
publishDate |
2015-12-01 |
description |
Reinforcement learning is a machine learning framework whereby an agent learns to perform a task by maximising its total reward received for selecting actions in each state. The policy mapping states to actions that the agent learns is either represented explicitly, or implicitly through a value function. It is common in reinforcement learning to discretise a continuous state space using tile coding or binary features. We prove an upper bound on the performance of discretisation for direct policy representation or value function approximation. |
topic |
Reinforcement learning Tile coding Performance Bounds Average Case Analysis |
url |
http://sacj.cs.uct.ac.za/index.php/sacj/article/view/284 |
work_keys_str_mv |
AT michaelrobinmitchley upperboundsontheperformanceofdiscretisationinreinforcementlearning |
_version_ |
1724775046676742144 |