Discovering Implied Serial Order Through Model-Free and Model-Based Learning

Humans and animals can learn to order a list of items without relying on explicit spatial or temporal cues. To do so, they appear to make use of transitivity, a property of all ordered sets. Here, we summarize relevant research on the transitive inference (TI) paradigm and its relationship to learni...

Full description

Bibliographic Details
Main Authors: Greg Jensen, Herbert S. Terrace, Vincent P. Ferrera
Format: Article
Language:English
Published: Frontiers Media S.A. 2019-08-01
Series:Frontiers in Neuroscience
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fnins.2019.00878/full
id doaj-b83cba251d3548eb9defdef20ef4a463
record_format Article
spelling doaj-b83cba251d3548eb9defdef20ef4a4632020-11-25T01:12:13ZengFrontiers Media S.A.Frontiers in Neuroscience1662-453X2019-08-011310.3389/fnins.2019.00878465935Discovering Implied Serial Order Through Model-Free and Model-Based LearningGreg Jensen0Greg Jensen1Herbert S. Terrace2Herbert S. Terrace3Vincent P. Ferrera4Vincent P. Ferrera5Department of Psychology, Columbia University, New York, NY, United StatesDepartment of Neuroscience, Columbia University, New York, NY, United StatesDepartment of Psychology, Columbia University, New York, NY, United StatesDepartment of Psychiatry, Columbia University, New York, NY, United StatesDepartment of Neuroscience, Columbia University, New York, NY, United StatesDepartment of Psychiatry, Columbia University, New York, NY, United StatesHumans and animals can learn to order a list of items without relying on explicit spatial or temporal cues. To do so, they appear to make use of transitivity, a property of all ordered sets. Here, we summarize relevant research on the transitive inference (TI) paradigm and its relationship to learning the underlying order of an arbitrary set of items. We compare six computational models of TI performance, three of which are model-free (Q-learning, Value Transfer, and REMERGE) and three of which are model-based (RL-Elo, Sequential Monte Carlo, and Betasort). Our goal is to assess the ability of these models to produce empirically observed features of TI behavior. Model-based approaches perform better under a wider range of scenarios, but no single model explains the full scope of behaviors reported in the TI literature.https://www.frontiersin.org/article/10.3389/fnins.2019.00878/fullreinforcement learningmodel-free learningmodel-based learningcognitive mapstransitive inference
collection DOAJ
language English
format Article
sources DOAJ
author Greg Jensen
Greg Jensen
Herbert S. Terrace
Herbert S. Terrace
Vincent P. Ferrera
Vincent P. Ferrera
spellingShingle Greg Jensen
Greg Jensen
Herbert S. Terrace
Herbert S. Terrace
Vincent P. Ferrera
Vincent P. Ferrera
Discovering Implied Serial Order Through Model-Free and Model-Based Learning
Frontiers in Neuroscience
reinforcement learning
model-free learning
model-based learning
cognitive maps
transitive inference
author_facet Greg Jensen
Greg Jensen
Herbert S. Terrace
Herbert S. Terrace
Vincent P. Ferrera
Vincent P. Ferrera
author_sort Greg Jensen
title Discovering Implied Serial Order Through Model-Free and Model-Based Learning
title_short Discovering Implied Serial Order Through Model-Free and Model-Based Learning
title_full Discovering Implied Serial Order Through Model-Free and Model-Based Learning
title_fullStr Discovering Implied Serial Order Through Model-Free and Model-Based Learning
title_full_unstemmed Discovering Implied Serial Order Through Model-Free and Model-Based Learning
title_sort discovering implied serial order through model-free and model-based learning
publisher Frontiers Media S.A.
series Frontiers in Neuroscience
issn 1662-453X
publishDate 2019-08-01
description Humans and animals can learn to order a list of items without relying on explicit spatial or temporal cues. To do so, they appear to make use of transitivity, a property of all ordered sets. Here, we summarize relevant research on the transitive inference (TI) paradigm and its relationship to learning the underlying order of an arbitrary set of items. We compare six computational models of TI performance, three of which are model-free (Q-learning, Value Transfer, and REMERGE) and three of which are model-based (RL-Elo, Sequential Monte Carlo, and Betasort). Our goal is to assess the ability of these models to produce empirically observed features of TI behavior. Model-based approaches perform better under a wider range of scenarios, but no single model explains the full scope of behaviors reported in the TI literature.
topic reinforcement learning
model-free learning
model-based learning
cognitive maps
transitive inference
url https://www.frontiersin.org/article/10.3389/fnins.2019.00878/full
work_keys_str_mv AT gregjensen discoveringimpliedserialorderthroughmodelfreeandmodelbasedlearning
AT gregjensen discoveringimpliedserialorderthroughmodelfreeandmodelbasedlearning
AT herbertsterrace discoveringimpliedserialorderthroughmodelfreeandmodelbasedlearning
AT herbertsterrace discoveringimpliedserialorderthroughmodelfreeandmodelbasedlearning
AT vincentpferrera discoveringimpliedserialorderthroughmodelfreeandmodelbasedlearning
AT vincentpferrera discoveringimpliedserialorderthroughmodelfreeandmodelbasedlearning
_version_ 1725167799136944128