Information selection plays a crucial role in how individuals navigate online content. While confirmation bias has been implicated in this phenomenon, its interaction with reinforcement learning dynamics and internal confidence signals remains poorly understood. Here, we examined how veracity judgments and confidence shape choices when probabilistic rewards are tied to different epistemic attributes of news headlines. Participants completed a three-phase paradigm that combined news classification, a probabilistic learning task with varying reward contingencies, and a final reevaluation phase. Using real and false headlines judged for veracity and confidence, we created personalized sets of stimulus categories that were later used in a two-armed bandit task. In different blocks of trials, reinforcement was probabilistically associated with either the perceived truthfulness or confidence of each item. Across all experimental phases, pupil dilation provided neurophysiological signatures of belief-related processing. At a behavioral level, participants showed higher accuracies and learning rates when rewards were contingent on their previous judgments of veracity, whereas performance was markedly reduced when reinforcement favored confidence, especially low-confidence options. Pupillometric data revealed predecisional modulations tied to subjective confidence, while computational modeling showed that participants relied on feature-based generalization when veracity predicted reward and shifted toward valence-sensitive updating when contingencies no longer matched their prior epistemic structure. Together, these results reveal how veracity and confidence jointly guide reinforcement-driven choices and modulate the flexibility of belief-related decisions. By integrating cognitive, computational, and physiological data, our study provides a mechanistic understanding of how prior beliefs shape learning in complex and misinformation-rich contexts.

Eye of the beholder: Pupillary response reflects how subjective prior beliefs shape reinforcement learning with fake news / Lozito, Silvana; Piga, Valentina; Lo Presti, Sara; Scuderi, Angelica; Doricchi, Fabrizio; Silvetti, Massimo; Lasaponara, Stefano. - In: PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA. - ISSN 0027-8424. - 123:16(2026). [10.1073/pnas.2518776123]

Eye of the beholder: Pupillary response reflects how subjective prior beliefs shape reinforcement learning with fake news

Lozito, Silvana
Methodology
;
Piga, Valentina
Investigation
;
Lo Presti, Sara
Formal Analysis
;
Scuderi, Angelica
Formal Analysis
;
Doricchi, Fabrizio
Funding Acquisition
;
Lasaponara, Stefano
Conceptualization
2026

Abstract

Information selection plays a crucial role in how individuals navigate online content. While confirmation bias has been implicated in this phenomenon, its interaction with reinforcement learning dynamics and internal confidence signals remains poorly understood. Here, we examined how veracity judgments and confidence shape choices when probabilistic rewards are tied to different epistemic attributes of news headlines. Participants completed a three-phase paradigm that combined news classification, a probabilistic learning task with varying reward contingencies, and a final reevaluation phase. Using real and false headlines judged for veracity and confidence, we created personalized sets of stimulus categories that were later used in a two-armed bandit task. In different blocks of trials, reinforcement was probabilistically associated with either the perceived truthfulness or confidence of each item. Across all experimental phases, pupil dilation provided neurophysiological signatures of belief-related processing. At a behavioral level, participants showed higher accuracies and learning rates when rewards were contingent on their previous judgments of veracity, whereas performance was markedly reduced when reinforcement favored confidence, especially low-confidence options. Pupillometric data revealed predecisional modulations tied to subjective confidence, while computational modeling showed that participants relied on feature-based generalization when veracity predicted reward and shifted toward valence-sensitive updating when contingencies no longer matched their prior epistemic structure. Together, these results reveal how veracity and confidence jointly guide reinforcement-driven choices and modulate the flexibility of belief-related decisions. By integrating cognitive, computational, and physiological data, our study provides a mechanistic understanding of how prior beliefs shape learning in complex and misinformation-rich contexts.
2026
confirmation bias; reinforcement learning; belief updating; fake news; pupillometry
01 Pubblicazione su rivista::01a Articolo in rivista
Eye of the beholder: Pupillary response reflects how subjective prior beliefs shape reinforcement learning with fake news / Lozito, Silvana; Piga, Valentina; Lo Presti, Sara; Scuderi, Angelica; Doricchi, Fabrizio; Silvetti, Massimo; Lasaponara, Stefano. - In: PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA. - ISSN 0027-8424. - 123:16(2026). [10.1073/pnas.2518776123]
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1765767
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact