[Comp-neuro] World wide VVTNS series (6th season): Towards a general model of human reward-based learning | Maria Eckstein Google Deepmind | Wednesday, May 20, 2026, at 11:00 am ET

May 17, 2026

      [image: VVTNS.png]
https://www.wwtns.online
<https://streaklinks.com/A9c7PbbpKY7PxB6PaAJWGD3-/https%3A%2F%2Fwww.wwtns.onl...>
-
on twitter: wwtns@TheoreticalWide

You are cordially invited to the lecture

Maria Eckstein

Google Deepmind

 on the topic of

  Towards a general model of human reward-based learning

The lecture will be held on Zoom on May 20, 2026 at *11:00 am ET *
...
To receive the link: https://www.wwtns.online/register-page
...
*Abstract: *Traditional work in the study of human reward-based learning
involves designing an experimental task---often inspired by Reinforcement
Learning (RL) theory---and fits a small set of computational models---often
inspired by RL algorithms---to that dataset. For example, researchers often
model human behavior on bandit tasks using variants of Q-learning. While
this approach has been highly productive, leading to landmark discoveries
such as the dopamine reward prediction error hypothesis, it also has
limitations. This talk focuses on the lack of generalizability of such
models: Even if they closely fit behavior on the original task, models
derived from the one-task-one-model paradigm usually predict behavior on
other tasks quite poorly. I argue that this lack of generalizability is a
fundamental problem for the cognitive sciences: we intuitively expect our
models to be robust to superficial task differences, such as variations in
the number of choice options, reward probabilities, or the exact kind of
non-stationarity. I will propose potential solutions to this problem along
two dimensions: the behavioral dataset and the computational model.
Regarding computational models, I will introduce work in which we moved
beyond the limitations of hand-crafted one-off models by employing
flexible, data-driven methods. These methods allowed us to compare classes
of models instead of individual model instances, allowing us to cover the
space of possible models more exhaustively, and innovate cognitive
mechanisms very efficiently. For the behavioral dataset, we move from using
single learning tasks to a comprehensive task space that encompasses most
existing paradigms in the literature, while closing the gaps between them
in a near-continuous fashion. Our results suggest that more general models
in conjunction with broader datasets can pave the road toward increasingly
general models of human reward-based learning and decision making, and a
persistent departure from many aspects of RL theory.

*About VVTNS : Launched as the World Wide  Theoretical Neuroscience Seminar
(WWTNS) in November 2020 and renamed in homage to Carl van Vreeswijk in
Memoriam (April 20, 2022), Speakers have the occasion to talk about
theoretical aspects of their work which cannot be discussed in a setting
where the majority of the audience consists of experimentalists. The
seminars, **held on Wednesdays at 11 am ET,**  are 45-50 min long followed
by a discussion. The talks are recorded with authorization of the speaker
and are available to everybody on our YouTube channel.*

ᐧ
ᐧ

[Comp-neuro] World wide VVTNS series (6th season): Towards a general model of human reward-based learning | Maria Eckstein Google Deepmind | Wednesday, May 20, 2026, at 11:00 am ET

David Hansel