Convergence Results for Some Temporal Difference Methods Based on Least Squares
We consider finite-state Markov decision processes, and prove convergence and rate of convergence results for certain least squares policy evaluation algorithms of the type known as LSPE(lambda ). These are temporal difference methods for constructing a linear function approximation of the cost func...
Main Authors: | Yu, Huizhen (Contributor), Bertsekas, Dimitri P. (Contributor) |
---|---|
Other Authors: | Massachusetts Institute of Technology. Laboratory for Information and Decision Systems (Contributor) |
Format: | Article |
Language: | English |
Published: |
Institute of Electrical and Electronics Engineers,
2012-10-18T19:03:35Z.
|
Subjects: | |
Online Access: | Get fulltext |
Similar Items
-
Least Squares Temporal Difference Methods: An Analysis under General Conditions
by: Yu, Huizhen
Published: (2013) -
A unified framework for temporal difference methods
by: Bertsekas, Dimitri P.
Published: (2010) -
Pathologies of Temporal Difference Methods in Approximate Dynamic Programming
by: Bertsekas, Dimitri P.
Published: (2011) -
ON THE CONVERGENCE OF THE LEAST SQUARE METHOD IN CASE OF NON-UNIFORM GRIDS
by: M. S. Sultanakhmedov
Published: (2019-11-01) -
On Investigation of the convergence by least square method for solving Fredholm integral
by: Mohammad Hasso, et al.
Published: (2008-06-01)