Efficient Preference-based Reinforcement Learning
Common reinforcement learning algorithms assume access to a numeric feedback signal. The numeric feedback contains a high amount of information and can be maximized efficiently. However, the definition of a numeric feedback signal can be difficult in practise due to several limitations and badly def...
Main Author: | |
---|---|
Format: | Others |
Language: | en |
Published: |
2017
|
Online Access: | https://tuprints.ulb.tu-darmstadt.de/6952/1/ThesisColorMerged.pdf Wirth, Christian <http://tuprints.ulb.tu-darmstadt.de/view/person/Wirth=3AChristian=3A=3A.html> (2017): Efficient Preference-based Reinforcement Learning.Darmstadt, Technische Universität, [Ph.D. Thesis] |