Minimal Exploration in Episodic Reinforcement Learning
Exploration-exploitation trade-off is a fundamental dilemma that reinforcement learning algorithms face. This dilemma is also central to the design of various state of the art bandit algorithms. We take inspiration from these algorithms and try to design reinforcement learning algorithms in an episo...
Main Author: | Tripathi, Ardhendu Shekhar |
---|---|
Format: | Others |
Language: | English |
Published: |
KTH, Skolan för elektroteknik och datavetenskap (EECS)
2018
|
Subjects: | |
Online Access: | http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-233579 |
Similar Items
-
Minimizing Regret in Combinatorial Bandits and Reinforcement Learning
by: Talebi Mazraeh Shahi, Mohammad Sadegh
Published: (2017) -
Organisational ambidexterity in manufacturing SMEs : An empirical study of managers’ and workers’ perceptions of ambidextrous elements
by: Eriksson, Gusten, et al.
Published: (2019) -
Exploration and Exploitation in Reinforcement Learning
by: Adelsbo, Felix
Published: (2018) -
Exploring context awareness in mobile games
by: Fridvall, Freddie, et al.
Published: (2018) -
Robust Reinforcement Learning for Quadcopter Control
by: Bjarre, Lukas
Published: (2019)