PAC-optimal, Non-parametric Algorithms and Bounds for Exploration in Concurrent MDPs with Delayed Updates
<p>As the reinforcement learning community has shifted its focus from heuristic methods to methods that have performance guarantees, PAC-optimal exploration algorithms have received significant attention. Unfortunately, the majority of current PAC-optimal exploration algorithms are inapplicabl...
Main Author: | Pazis, Jason |
---|---|
Other Authors: | Parr, Ronald |
Published: |
2015
|
Subjects: | |
Online Access: | http://hdl.handle.net/10161/11334 |
Similar Items
-
PAC Optimal Exploration Algorithm Named RMAX-KNN
by: LI Chao, MEN Changqian, WANG Wenjian
Published: (2020-03-01) -
Assessing the ethical issues surrounding multi-disciplinary practices: the impact of MDPs on the profession of urban planning in Canada
by: Wandell, Robert
Published: (2007) -
PACS for the Developing World
by: Jeffrey B. Mendel, et al.
Published: (2015-11-01) -
A Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes
by: Tuyen P. Le, et al.
Published: (2018-01-01) -
Evaluating latest developments in PACS and their impact on radiology practices: A systematic literature review
by: Mona Alhajeri, et al.
Published: (2017-01-01)