Balancing exploration and exploitation with information and randomization

RC Wilson, E Bonawitz, VD Costa, RB Ebitz - Current opinion in behavioral …, 2021 - Elsevier
Explore-exploit decisions require us to trade off the benefits of exploring unknown options to
learn more about them, with exploiting known options, for immediate reward. Such decisions …

A primer on foraging and the explore/exploit trade-off for psychiatry research

MA Addicott, JM Pearson, MM Sweitzer… - …, 2017 - nature.com
Foraging is a fundamental behavior, and many types of animals appear to have solved
foraging problems using a shared set of mechanisms. Perhaps the most common foraging …

Computational noise in reward-guided learning drives behavioral variability in volatile environments

C Findling, V Skvortsova, R Dromnelle… - Nature …, 2019 - nature.com
When learning the value of actions in volatile environments, humans often make seemingly
irrational decisions that fail to maximize expected value. We reasoned that these 'non …

When the brain takes a break: A model-based analysis of mind wandering

M Mittner, W Boekel, AM Tucker, BM Turner… - Journal of …, 2014 - jneurosci.org
Mind wandering is an ubiquitous phenomenon in everyday life. In the cognitive
neurosciences, mind wandering has been associated with several distinct neural processes …

[HTML][HTML] Rostrolateral prefrontal cortex and individual differences in uncertainty-driven exploration

D Badre, BB Doll, NM Long, MJ Frank - Neuron, 2012 - cell.com
How do individuals decide to act based on a rewarding status quo versus an unexplored
choice that might yield a better outcome? Recent evidence suggests that individuals may …

[HTML][HTML] Computation noise in human learning and decision-making: origin, impact, function

C Findling, V Wyart - Current Opinion in Behavioral Sciences, 2021 - Elsevier
Highlights•Computation noise drives human decision variability under uncertainty.•
Computation noise sets cognitive constraints on information processing.•Computation noise …

Pharmacological fingerprints of contextual uncertainty

L Marshall, C Mathys, D Ruge, AO De Berker… - PLoS …, 2016 - journals.plos.org
Successful interaction with the environment requires flexible updating of our beliefs about
the world. By estimating the likelihood of future events, it is possible to prepare appropriate …

Acetylcholine and noradrenaline enhance foraging optimality in humans

N Doren, HK Chung, M Grueschow… - Proceedings of the …, 2023 - pnas.org
Foraging theory prescribes when optimal foragers should leave the current option for more
rewarding alternatives. Actual foragers often exploit options longer than prescribed by the …

Human complex exploration strategies are enriched by noradrenaline-modulated heuristics

M Dubois, J Habicht, J Michely, R Moran, RJ Dolan… - Elife, 2021 - elifesciences.org
An exploration-exploitation trade-off, the arbitration between sampling a lesser-known
against a known rich option, is thought to be solved using computationally demanding …

Catecholaminergic regulation of learning rate in a dynamic environment

M Jepma, PR Murphy, MR Nassar… - PLoS computational …, 2016 - journals.plos.org
Adaptive behavior in a changing world requires flexibly adapting one's rate of learning to the
rate of environmental change. Recent studies have examined the computational …