Reinforcement learning: Dopamine ramps with fuzzy value estimates.

Whittington JCR.; Behrens TEJ.

Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

Intranet
Accessibility
Cookies
Contact us
Log in

Reinforcement learning: Dopamine ramps with fuzzy value estimates.

Whittington JCR., Behrens TEJ.

A new study in reinforcement learning theory shows that extending the temporal difference algorithm to unbiased learning under state uncertainty explains the observed ramping behaviour of dopamine neurons.

Original publication

DOI

10.1016/j.cub.2022.01.070

Type

Journal article

Journal

Curr Biol

Publication Date

14/03/2022

Volume

Pages

R213 - R215