Learning action-oriented models through active inference.

Converging theories suggest that organisms learn and exploit probabilistic models of their environment. However, it remains unclear how such models can be learned in practice. The open-ended complexity of natural environments means that it is generally infeasible for organisms to model their environ...

Full description

Bibliographic Details
Main Authors: Alexander Tschantz, Anil K Seth, Christopher L Buckley
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-04-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1007805
id doaj-5aa38e8e83da42829ecb0e9adda34b14
record_format Article
spelling doaj-5aa38e8e83da42829ecb0e9adda34b142021-04-21T15:15:11ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582020-04-01164e100780510.1371/journal.pcbi.1007805Learning action-oriented models through active inference.Alexander TschantzAnil K SethChristopher L BuckleyConverging theories suggest that organisms learn and exploit probabilistic models of their environment. However, it remains unclear how such models can be learned in practice. The open-ended complexity of natural environments means that it is generally infeasible for organisms to model their environment comprehensively. Alternatively, action-oriented models attempt to encode a parsimonious representation of adaptive agent-environment interactions. One approach to learning action-oriented models is to learn online in the presence of goal-directed behaviours. This constrains an agent to behaviourally relevant trajectories, reducing the diversity of the data a model need account for. Unfortunately, this approach can cause models to prematurely converge to sub-optimal solutions, through a process we refer to as a bad-bootstrap. Here, we exploit the normative framework of active inference to show that efficient action-oriented models can be learned by balancing goal-oriented and epistemic (information-seeking) behaviours in a principled manner. We illustrate our approach using a simple agent-based model of bacterial chemotaxis. We first demonstrate that learning via goal-directed behaviour indeed constrains models to behaviorally relevant aspects of the environment, but that this approach is prone to sub-optimal convergence. We then demonstrate that epistemic behaviours facilitate the construction of accurate and comprehensive models, but that these models are not tailored to any specific behavioural niche and are therefore less efficient in their use of data. Finally, we show that active inference agents learn models that are parsimonious, tailored to action, and which avoid bad bootstraps and sub-optimal convergence. Critically, our results indicate that models learned through active inference can support adaptive behaviour in spite of, and indeed because of, their departure from veridical representations of the environment. Our approach provides a principled method for learning adaptive models from limited interactions with an environment, highlighting a route to sample efficient learning algorithms.https://doi.org/10.1371/journal.pcbi.1007805
collection DOAJ
language English
format Article
sources DOAJ
author Alexander Tschantz
Anil K Seth
Christopher L Buckley
spellingShingle Alexander Tschantz
Anil K Seth
Christopher L Buckley
Learning action-oriented models through active inference.
PLoS Computational Biology
author_facet Alexander Tschantz
Anil K Seth
Christopher L Buckley
author_sort Alexander Tschantz
title Learning action-oriented models through active inference.
title_short Learning action-oriented models through active inference.
title_full Learning action-oriented models through active inference.
title_fullStr Learning action-oriented models through active inference.
title_full_unstemmed Learning action-oriented models through active inference.
title_sort learning action-oriented models through active inference.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2020-04-01
description Converging theories suggest that organisms learn and exploit probabilistic models of their environment. However, it remains unclear how such models can be learned in practice. The open-ended complexity of natural environments means that it is generally infeasible for organisms to model their environment comprehensively. Alternatively, action-oriented models attempt to encode a parsimonious representation of adaptive agent-environment interactions. One approach to learning action-oriented models is to learn online in the presence of goal-directed behaviours. This constrains an agent to behaviourally relevant trajectories, reducing the diversity of the data a model need account for. Unfortunately, this approach can cause models to prematurely converge to sub-optimal solutions, through a process we refer to as a bad-bootstrap. Here, we exploit the normative framework of active inference to show that efficient action-oriented models can be learned by balancing goal-oriented and epistemic (information-seeking) behaviours in a principled manner. We illustrate our approach using a simple agent-based model of bacterial chemotaxis. We first demonstrate that learning via goal-directed behaviour indeed constrains models to behaviorally relevant aspects of the environment, but that this approach is prone to sub-optimal convergence. We then demonstrate that epistemic behaviours facilitate the construction of accurate and comprehensive models, but that these models are not tailored to any specific behavioural niche and are therefore less efficient in their use of data. Finally, we show that active inference agents learn models that are parsimonious, tailored to action, and which avoid bad bootstraps and sub-optimal convergence. Critically, our results indicate that models learned through active inference can support adaptive behaviour in spite of, and indeed because of, their departure from veridical representations of the environment. Our approach provides a principled method for learning adaptive models from limited interactions with an environment, highlighting a route to sample efficient learning algorithms.
url https://doi.org/10.1371/journal.pcbi.1007805
work_keys_str_mv AT alexandertschantz learningactionorientedmodelsthroughactiveinference
AT anilkseth learningactionorientedmodelsthroughactiveinference
AT christopherlbuckley learningactionorientedmodelsthroughactiveinference
_version_ 1714667641322340352