Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

Deciding between stimuli requires combining their learned value with one's sensory confidence. We trained mice in a visual task that probes this combination. Mouse choices reflected not only present confidence and past rewards but also past confidence. Their behavior conformed to a model that combines signal detection with reinforcement learning. In the model, the predicted value of the chosen option is the product of sensory confidence and learned value. We found precise correlates of this variable in the pre-outcome activity of midbrain dopamine neurons and of medial prefrontal cortical neurons. However, only the latter played a causal role: inactivating medial prefrontal cortex before outcome strengthened learning from the outcome. Dopamine neurons played a causal role only after outcome, when they encoded reward prediction errors graded by confidence, influencing subsequent choices. These results reveal neural signals that combine reward value with sensory confidence and guide subsequent learning.

Original publication

DOI

10.1016/j.neuron.2019.11.018

Type

Journal article

Journal

Neuron

Publication Date

19/02/2020

Volume

105

Pages

700 - 711.e6

Keywords

Calcium imaging, Decision confidence, Electrophysiology, Mice, Optogenetics, Psychophysics, Reinforcement learning, Animals, Choice Behavior, Dopaminergic Neurons, Learning, Male, Mice, Mice, Inbred C57BL, Mice, Transgenic, Optogenetics, Prefrontal Cortex, Reward