Optimal learning: Computational procedures for Bayes -adaptive Markov decision processes
This dissertation considers a particular aspect of sequential decision making under uncertainty in which, at each stage, a decision-making agent operating in an uncertain world takes an action that elicits a reinforcement signal and causes the state of the world to change. Optimal learning is a patt...
Main Author: | |
---|---|
Language: | ENG |
Published: |
ScholarWorks@UMass Amherst
2002
|
Subjects: | |
Online Access: | https://scholarworks.umass.edu/dissertations/AAI3039353 |