Spike-based reinforcement learning in continuous state and action space: when policy gradient methods fail.
Changes of synaptic connections between neurons are thought to be the physiological basis of learning. These changes can be gated by neuromodulators that encode the presence of reward. We study a family of reward-modulated synaptic learning rules for spiking neurons on a learning task in continuous...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2009-12-01
|
Series: | PLoS Computational Biology |
Online Access: | http://europepmc.org/articles/PMC2778872?pdf=render |