Temporal-difference reinforcement learning with distributed representations.

Temporal-difference (TD) algorithms have been proposed as models of reinforcement learning (RL). We examine two issues of distributed representation in these TD algorithms: distributed representations of belief and distributed discounting factors. Distributed representation of belief allows the beli...

Full description

Bibliographic Details
Main Authors: Zeb Kurth-Nelson, A David Redish
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2009-10-01
Series:PLoS ONE
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/19841749/pdf/?tool=EBI