Reinforcement learning with limited reinforcement: Using Bayes risk for active learning in POMDPs

Acting in domains where an agent must plan several steps ahead to achieve a goal can be a challenging task, especially if the agentʼs sensors provide only noisy or partial information. In this setting, Partially Observable Markov Decision Processes (POMDPs) provide a planning framework that optimall...

Full description

Bibliographic Details
Main Authors: Pineau, Joelle (Author), Doshi-Velez, Finale P (Contributor), Roy, Nicholas (Contributor)
Other Authors: Massachusetts Institute of Technology. Department of Aeronautics and Astronautics (Contributor), Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor)
Format: Article
Language:English
Published: Elsevier, 2017-04-20T17:54:32Z.
Subjects:
Online Access:Get fulltext