Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations.

High performance computing on the Graphics Processing Unit (GPU) is an emerging field driven by the promise of high computational power at a low cost. However, GPU programming is a non-trivial task and moreover architectural limitations raise the question of whether investing effort in this directio...

Full description

Bibliographic Details
Main Authors:	Paul Richmond, Lars Buesing, Michele Giugliano, Eleni Vasilaki
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2011-05-01
Series:	PLoS ONE
Online Access:	https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/21572529/?tool=EBI

id	doaj-78c4d4314a024d3c8c5eafcccbd88530
record_format	Article
spelling	doaj-78c4d4314a024d3c8c5eafcccbd885302021-03-03T19:53:29ZengPublic Library of Science (PLoS)PLoS ONE1932-62032011-05-0165e1853910.1371/journal.pone.0018539Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations.Paul RichmondLars BuesingMichele GiuglianoEleni VasilakiHigh performance computing on the Graphics Processing Unit (GPU) is an emerging field driven by the promise of high computational power at a low cost. However, GPU programming is a non-trivial task and moreover architectural limitations raise the question of whether investing effort in this direction may be worthwhile. In this work, we use GPU programming to simulate a two-layer network of Integrate-and-Fire neurons with varying degrees of recurrent connectivity and investigate its ability to learn a simplified navigation task using a policy-gradient learning rule stemming from Reinforcement Learning. The purpose of this paper is twofold. First, we want to support the use of GPUs in the field of Computational Neuroscience. Second, using GPU computing power, we investigate the conditions under which the said architecture and learning rule demonstrate best performance. Our work indicates that networks featuring strong Mexican-Hat-shaped recurrent connections in the top layer, where decision making is governed by the formation of a stable activity bump in the neural population (a "non-democratic" mechanism), achieve mediocre learning results at best. In absence of recurrent connections, where all neurons "vote" independently ("democratic") for a decision via population vector readout, the task is generally learned better and more robustly. Our study would have been extremely difficult on a desktop computer without the use of GPU programming. We present the routines developed for this purpose and show that a speed improvement of 5x up to 42x is provided versus optimised Python code. The higher speed is achieved when we exploit the parallelism of the GPU in the search of learning parameters. This suggests that efficient GPU programming can significantly reduce the time needed for simulating networks of spiking neurons, particularly when multiple parameter configurations are investigated.https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/21572529/?tool=EBI
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Paul Richmond Lars Buesing Michele Giugliano Eleni Vasilaki
spellingShingle	Paul Richmond Lars Buesing Michele Giugliano Eleni Vasilaki Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations. PLoS ONE
author_facet	Paul Richmond Lars Buesing Michele Giugliano Eleni Vasilaki
author_sort	Paul Richmond
title	Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations.
title_short	Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations.
title_full	Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations.
title_fullStr	Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations.
title_full_unstemmed	Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations.
title_sort	democratic population decisions result in robust policy-gradient learning: a parametric study with gpu simulations.
publisher	Public Library of Science (PLoS)
series	PLoS ONE
issn	1932-6203
publishDate	2011-05-01
description	High performance computing on the Graphics Processing Unit (GPU) is an emerging field driven by the promise of high computational power at a low cost. However, GPU programming is a non-trivial task and moreover architectural limitations raise the question of whether investing effort in this direction may be worthwhile. In this work, we use GPU programming to simulate a two-layer network of Integrate-and-Fire neurons with varying degrees of recurrent connectivity and investigate its ability to learn a simplified navigation task using a policy-gradient learning rule stemming from Reinforcement Learning. The purpose of this paper is twofold. First, we want to support the use of GPUs in the field of Computational Neuroscience. Second, using GPU computing power, we investigate the conditions under which the said architecture and learning rule demonstrate best performance. Our work indicates that networks featuring strong Mexican-Hat-shaped recurrent connections in the top layer, where decision making is governed by the formation of a stable activity bump in the neural population (a "non-democratic" mechanism), achieve mediocre learning results at best. In absence of recurrent connections, where all neurons "vote" independently ("democratic") for a decision via population vector readout, the task is generally learned better and more robustly. Our study would have been extremely difficult on a desktop computer without the use of GPU programming. We present the routines developed for this purpose and show that a speed improvement of 5x up to 42x is provided versus optimised Python code. The higher speed is achieved when we exploit the parallelism of the GPU in the search of learning parameters. This suggests that efficient GPU programming can significantly reduce the time needed for simulating networks of spiking neurons, particularly when multiple parameter configurations are investigated.
url	https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/21572529/?tool=EBI
work_keys_str_mv	AT paulrichmond democraticpopulationdecisionsresultinrobustpolicygradientlearningaparametricstudywithgpusimulations AT larsbuesing democraticpopulationdecisionsresultinrobustpolicygradientlearningaparametricstudywithgpusimulations AT michelegiugliano democraticpopulationdecisionsresultinrobustpolicygradientlearningaparametricstudywithgpusimulations AT elenivasilaki democraticpopulationdecisionsresultinrobustpolicygradientlearningaparametricstudywithgpusimulations
_version_	1714825187075031040

Democratic population decisions result in robust policy-gradient learning: a parametric study with GPU simulations.

Similar Items