Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences

Sophie Bavard; Maël Lebreton; Mehdi Khamassi; Giorgio Coricelli; Stefano Palminteri

doi:10.1038/s41467-018-06781-2

Article Dans Une Revue Nature Communications Année : 2018

Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences

, , (1) , ,

Sophie Bavard

Fonction : Auteur

Maël Lebreton

Fonction : Auteur
PersonId : 1286627
IdHAL : mael-lebreton
ORCID : 0000-0002-2071-4890

Mehdi Khamassi

Fonction : Auteur
PersonId : 186
IdHAL : mehdi-khamassi
ORCID : 0000-0002-2515-1046
IdRef : 12845072X

Institut des Systèmes Intelligents et de Robotique

Giorgio Coricelli

Fonction : Auteur

Stefano Palminteri

Fonction : Auteur

Résumé

In economics and perceptual decision-making contextual effects are well documented, where decision weights are adjusted as a function of the distribution of stimuli. Yet, in reinforcement learning literature whether and how contextual information pertaining to decision states is integrated in learning algorithms has received comparably little attention. Here, we investigate reinforcement learning behavior and its computational substrates in a task where we orthogonally manipulate outcome valence and magnitude, resulting in systematic variations in state-values. Model comparison indicates that subjects' behavior is best accounted for by an algorithm which includes both reference point-dependence and range-adaptation-two crucial features of state-dependent valuation. In addition, we find that state-dependent outcome valuation progressively emerges, is favored by increasing outcome information and correlated with explicit understanding of the task structure. Finally, our data clearly show that, while being locally adaptive (for instance in negative valence and small magnitude contexts), state-dependent valuation comes at the cost of seemingly irrational choices, when options are extrapolated out from their original contexts.

Domaines

Sciences du Vivant [q-bio]

Fichier principal

s41467-018-06781-2.pdf (1.02 Mo)

Origine	Publication financée par une institution

Gestionnaire HAL 2 Sorbonne Université : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-01927184

Soumis le : lundi 19 novembre 2018-16:43:40

Dernière modification le : mercredi 30 octobre 2024-13:28:03

Archivage à long terme le : mercredi 20 février 2019-16:08:42

Dates et versions

hal-01927184 , version 1 (19-11-2018)

Licence

Paternité

Identifiants

HAL Id : hal-01927184 , version 1
DOI : 10.1038/s41467-018-06781-2

Citer

Sophie Bavard, Maël Lebreton, Mehdi Khamassi, Giorgio Coricelli, Stefano Palminteri. Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences. Nature Communications, 2018, 9, pp.4503. ⟨10.1038/s41467-018-06781-2⟩. ⟨hal-01927184⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS ISIR SORBONNE-UNIVERSITE SU-SCIENCES ANR ISIR_AMAC

47 Consultations

141 Téléchargements

Reference-point centering and range-adaptation enhance human reinforcement learning at the cost of irrational preferences

Résumé

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager