Discovering Implied Serial Order Through Model-Free and Model-Based Learning
Humans and animals can learn to order a list of items without relying on explicit spatial or temporal cues. To do so, they appear to make use of transitivity, a property of all ordered sets. Here, we summarize relevant research on the transitive inference (TI) paradigm and its relationship to learni...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2019-08-01
|
Series: | Frontiers in Neuroscience |
Subjects: | |
Online Access: | https://www.frontiersin.org/article/10.3389/fnins.2019.00878/full |
id |
doaj-b83cba251d3548eb9defdef20ef4a463 |
---|---|
record_format |
Article |
spelling |
doaj-b83cba251d3548eb9defdef20ef4a4632020-11-25T01:12:13ZengFrontiers Media S.A.Frontiers in Neuroscience1662-453X2019-08-011310.3389/fnins.2019.00878465935Discovering Implied Serial Order Through Model-Free and Model-Based LearningGreg Jensen0Greg Jensen1Herbert S. Terrace2Herbert S. Terrace3Vincent P. Ferrera4Vincent P. Ferrera5Department of Psychology, Columbia University, New York, NY, United StatesDepartment of Neuroscience, Columbia University, New York, NY, United StatesDepartment of Psychology, Columbia University, New York, NY, United StatesDepartment of Psychiatry, Columbia University, New York, NY, United StatesDepartment of Neuroscience, Columbia University, New York, NY, United StatesDepartment of Psychiatry, Columbia University, New York, NY, United StatesHumans and animals can learn to order a list of items without relying on explicit spatial or temporal cues. To do so, they appear to make use of transitivity, a property of all ordered sets. Here, we summarize relevant research on the transitive inference (TI) paradigm and its relationship to learning the underlying order of an arbitrary set of items. We compare six computational models of TI performance, three of which are model-free (Q-learning, Value Transfer, and REMERGE) and three of which are model-based (RL-Elo, Sequential Monte Carlo, and Betasort). Our goal is to assess the ability of these models to produce empirically observed features of TI behavior. Model-based approaches perform better under a wider range of scenarios, but no single model explains the full scope of behaviors reported in the TI literature.https://www.frontiersin.org/article/10.3389/fnins.2019.00878/fullreinforcement learningmodel-free learningmodel-based learningcognitive mapstransitive inference |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Greg Jensen Greg Jensen Herbert S. Terrace Herbert S. Terrace Vincent P. Ferrera Vincent P. Ferrera |
spellingShingle |
Greg Jensen Greg Jensen Herbert S. Terrace Herbert S. Terrace Vincent P. Ferrera Vincent P. Ferrera Discovering Implied Serial Order Through Model-Free and Model-Based Learning Frontiers in Neuroscience reinforcement learning model-free learning model-based learning cognitive maps transitive inference |
author_facet |
Greg Jensen Greg Jensen Herbert S. Terrace Herbert S. Terrace Vincent P. Ferrera Vincent P. Ferrera |
author_sort |
Greg Jensen |
title |
Discovering Implied Serial Order Through Model-Free and Model-Based Learning |
title_short |
Discovering Implied Serial Order Through Model-Free and Model-Based Learning |
title_full |
Discovering Implied Serial Order Through Model-Free and Model-Based Learning |
title_fullStr |
Discovering Implied Serial Order Through Model-Free and Model-Based Learning |
title_full_unstemmed |
Discovering Implied Serial Order Through Model-Free and Model-Based Learning |
title_sort |
discovering implied serial order through model-free and model-based learning |
publisher |
Frontiers Media S.A. |
series |
Frontiers in Neuroscience |
issn |
1662-453X |
publishDate |
2019-08-01 |
description |
Humans and animals can learn to order a list of items without relying on explicit spatial or temporal cues. To do so, they appear to make use of transitivity, a property of all ordered sets. Here, we summarize relevant research on the transitive inference (TI) paradigm and its relationship to learning the underlying order of an arbitrary set of items. We compare six computational models of TI performance, three of which are model-free (Q-learning, Value Transfer, and REMERGE) and three of which are model-based (RL-Elo, Sequential Monte Carlo, and Betasort). Our goal is to assess the ability of these models to produce empirically observed features of TI behavior. Model-based approaches perform better under a wider range of scenarios, but no single model explains the full scope of behaviors reported in the TI literature. |
topic |
reinforcement learning model-free learning model-based learning cognitive maps transitive inference |
url |
https://www.frontiersin.org/article/10.3389/fnins.2019.00878/full |
work_keys_str_mv |
AT gregjensen discoveringimpliedserialorderthroughmodelfreeandmodelbasedlearning AT gregjensen discoveringimpliedserialorderthroughmodelfreeandmodelbasedlearning AT herbertsterrace discoveringimpliedserialorderthroughmodelfreeandmodelbasedlearning AT herbertsterrace discoveringimpliedserialorderthroughmodelfreeandmodelbasedlearning AT vincentpferrera discoveringimpliedserialorderthroughmodelfreeandmodelbasedlearning AT vincentpferrera discoveringimpliedserialorderthroughmodelfreeandmodelbasedlearning |
_version_ |
1725167799136944128 |