Learning Hierarchical Policies from Human Feedback
Robots are on the verge of becoming ubiquitous. In the form of affordable humanoid toy robots, autonomous cars, vacuum robots or quadrocopters, robots are becoming part of our everyday life. As of today, most of these robots still follow largely hard coded behavior routines. Constraining a robot’s b...
Main Author: | |
---|---|
Format: | Others |
Language: | English en |
Published: |
2016
|
Online Access: | https://tuprints.ulb.tu-darmstadt.de/5345/1/DefenseBW.pdf Daniel, Christian <http://tuprints.ulb.tu-darmstadt.de/view/person/Daniel=3AChristian=3A=3A.html> (2016): Learning Hierarchical Policies from Human Feedback.Darmstadt, Technische Universität Darmstadt, [Ph.D. Thesis] |