Optimization-based Approximate Dynamic Programming

Reinforcement learning algorithms hold promise in many complex domains, such as resource management and planning under uncertainty. Most reinforcement learning algorithms are iterative - they successively approximate the solution based on a set of samples and features. Although these iterative algor...

Full description

Bibliographic Details
Main Author: Petrik, Marek
Format: Others
Published: ScholarWorks@UMass Amherst 2010
Subjects:
Online Access:https://scholarworks.umass.edu/open_access_dissertations/308
https://scholarworks.umass.edu/cgi/viewcontent.cgi?article=1302&context=open_access_dissertations