Additively Combining Utilities and Beliefs: Research Gaps and Algorithmic Developments

Value-based decision making in complex environments, such as those with uncertain and volatile mapping of reward probabilities onto options, may engender computational strategies that are not necessarily optimal in terms of normative frameworks but may ensure effective learning and behavioral flexib...

Full description

Bibliographic Details
Main Authors: Anush Ghambaryan, Boris Gutkin, Vasily Klucharev, Etienne Koechlin
Format: Article
Language:English
Published: Frontiers Media S.A. 2021-10-01
Series:Frontiers in Neuroscience
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fnins.2021.704728/full
id doaj-a6f6b186809c4a97bed79bc650bd231a
record_format Article
spelling doaj-a6f6b186809c4a97bed79bc650bd231a2021-10-01T07:34:51ZengFrontiers Media S.A.Frontiers in Neuroscience1662-453X2021-10-011510.3389/fnins.2021.704728704728Additively Combining Utilities and Beliefs: Research Gaps and Algorithmic DevelopmentsAnush Ghambaryan0Anush Ghambaryan1Boris Gutkin2Boris Gutkin3Vasily Klucharev4Etienne Koechlin5Centre for Cognition and Decision Making, HSE University, Moscow, RussiaEcole Normale Supérieure, PSL Research University, Paris, FranceCentre for Cognition and Decision Making, HSE University, Moscow, RussiaEcole Normale Supérieure, PSL Research University, Paris, FranceCentre for Cognition and Decision Making, HSE University, Moscow, RussiaEcole Normale Supérieure, PSL Research University, Paris, FranceValue-based decision making in complex environments, such as those with uncertain and volatile mapping of reward probabilities onto options, may engender computational strategies that are not necessarily optimal in terms of normative frameworks but may ensure effective learning and behavioral flexibility in conditions of limited neural computational resources. In this article, we review a suboptimal strategy – additively combining reward magnitude and reward probability attributes of options for value-based decision making. In addition, we present computational intricacies of a recently developed model (named MIX model) representing an algorithmic implementation of the additive strategy in sequential decision-making with two options. We also discuss its opportunities; and conceptual, inferential, and generalization issues. Furthermore, we suggest future studies that will reveal the potential and serve the further development of the MIX model as a general model of value-based choice making.https://www.frontiersin.org/articles/10.3389/fnins.2021.704728/fulladditive strategyuncertain and volatile environmentnormalized utilitystate beliefvalue-based decision makingone-armed bandit task
collection DOAJ
language English
format Article
sources DOAJ
author Anush Ghambaryan
Anush Ghambaryan
Boris Gutkin
Boris Gutkin
Vasily Klucharev
Etienne Koechlin
spellingShingle Anush Ghambaryan
Anush Ghambaryan
Boris Gutkin
Boris Gutkin
Vasily Klucharev
Etienne Koechlin
Additively Combining Utilities and Beliefs: Research Gaps and Algorithmic Developments
Frontiers in Neuroscience
additive strategy
uncertain and volatile environment
normalized utility
state belief
value-based decision making
one-armed bandit task
author_facet Anush Ghambaryan
Anush Ghambaryan
Boris Gutkin
Boris Gutkin
Vasily Klucharev
Etienne Koechlin
author_sort Anush Ghambaryan
title Additively Combining Utilities and Beliefs: Research Gaps and Algorithmic Developments
title_short Additively Combining Utilities and Beliefs: Research Gaps and Algorithmic Developments
title_full Additively Combining Utilities and Beliefs: Research Gaps and Algorithmic Developments
title_fullStr Additively Combining Utilities and Beliefs: Research Gaps and Algorithmic Developments
title_full_unstemmed Additively Combining Utilities and Beliefs: Research Gaps and Algorithmic Developments
title_sort additively combining utilities and beliefs: research gaps and algorithmic developments
publisher Frontiers Media S.A.
series Frontiers in Neuroscience
issn 1662-453X
publishDate 2021-10-01
description Value-based decision making in complex environments, such as those with uncertain and volatile mapping of reward probabilities onto options, may engender computational strategies that are not necessarily optimal in terms of normative frameworks but may ensure effective learning and behavioral flexibility in conditions of limited neural computational resources. In this article, we review a suboptimal strategy – additively combining reward magnitude and reward probability attributes of options for value-based decision making. In addition, we present computational intricacies of a recently developed model (named MIX model) representing an algorithmic implementation of the additive strategy in sequential decision-making with two options. We also discuss its opportunities; and conceptual, inferential, and generalization issues. Furthermore, we suggest future studies that will reveal the potential and serve the further development of the MIX model as a general model of value-based choice making.
topic additive strategy
uncertain and volatile environment
normalized utility
state belief
value-based decision making
one-armed bandit task
url https://www.frontiersin.org/articles/10.3389/fnins.2021.704728/full
work_keys_str_mv AT anushghambaryan additivelycombiningutilitiesandbeliefsresearchgapsandalgorithmicdevelopments
AT anushghambaryan additivelycombiningutilitiesandbeliefsresearchgapsandalgorithmicdevelopments
AT borisgutkin additivelycombiningutilitiesandbeliefsresearchgapsandalgorithmicdevelopments
AT borisgutkin additivelycombiningutilitiesandbeliefsresearchgapsandalgorithmicdevelopments
AT vasilyklucharev additivelycombiningutilitiesandbeliefsresearchgapsandalgorithmicdevelopments
AT etiennekoechlin additivelycombiningutilitiesandbeliefsresearchgapsandalgorithmicdevelopments
_version_ 1716862032953737216