Optimization-based approximate dynamic programming
Reinforcement learning algorithms hold promise in many complex domains, such as resource management and planning under uncertainty. Most reinforcement learning algorithms are iterative—they successively approximate the solution based on a set of samples and features. Although these iterative algorit...
Main Author: | |
---|---|
Language: | ENG |
Published: |
ScholarWorks@UMass Amherst
2010
|
Subjects: | |
Online Access: | https://scholarworks.umass.edu/dissertations/AAI3427564 |