Heuristic and optimal policy computations in the human brain during sequential decision-making

Description: Abstract of the article: Optimal decisions across extended time horizons require value calculations over multiple probabilistic future states. Humans may circumvent such complex computations by resorting to easy-to-compute heuristics that approximate optimal solutions. To probe the potential interplay between heuristic and optimal computations, we develop a novel sequential decision-making task, framed as virtual foraging in which participants had to avoid virtual starvation. Rewards depend only on final outcomes over five-trial blocks, necessitating planning over five sequential decisions and probabilistic outcomes. Here, we report model comparisons demonstrating that participants primarily rely on the best available heuristic but also use the normatively optimal policy. FMRI signals in medial prefrontal cortex (MPFC) relate to heuristic and optimal policies and associated choice uncertainties. Crucially, reaction times and dorsal MPFC activity scale with discrepancies between heuristic and optimal policies. Thus, sequential decision-making in humans may emerge from integration between heuristic and optimal policies, implemented by controllers in MPFC.

Citation guidelines

If you use the data from this collection please include the following persistent identifier in the text of your manuscript:

https://identifiers.org/neurovault.collection:3242

This will help to track the use of this data in the literature. In addition, consider also citing the paper related to this collection.

Field	Value
Compact Identifier	https://identifiers.org/neurovault.collection:3242
Add Date	Dec. 8, 2017, 10:50 a.m.
Uploaded by	christoph.w.korn
Contributors
Related article DOI	10.1038/s41467-017-02750-3
Related article authors	Christoph W. Korn and Dominik R. Bach