Efficient Preference-based Reinforcement Learning

Common reinforcement learning algorithms assume access to a numeric feedback signal. The numeric feedback contains a high amount of information and can be maximized efficiently. However, the definition of a numeric feedback signal can be difficult in practise due to several limitations and badly def...

Full description

Bibliographic Details
Main Author: Wirth, Christian
Format: Others
Language:en
Published: 2017
Online Access:https://tuprints.ulb.tu-darmstadt.de/6952/1/ThesisColorMerged.pdf
Wirth, Christian <http://tuprints.ulb.tu-darmstadt.de/view/person/Wirth=3AChristian=3A=3A.html> (2017): Efficient Preference-based Reinforcement Learning.Darmstadt, Technische Universität, [Ph.D. Thesis]