Context Copying Modulation: The Role of Entropy Neurons in Managing Parametric and Contextual Knowledge Conflicts

The behavior of Large Language Models (LLMs) when facing contextual information that conflicts with their internal parametric knowledge is inconsistent, with no generally accepted explanation for the expected outcome distribution. Recent work has identified in autoregressive transformer models a class of neurons – called entropy neurons – that produce a significant effect on the model output entropy while having an overall moderate impact on the ranking of the predicted tokens. In this paper, we investigate the preliminary claim that these neurons are involved in inhibiting context copying behavior in transformers by looking at their role in resolving conflicts between contextual and parametric information. We show that entropy neurons are responsible for suppressing context copying across a range of LLMs, and that ablating them leads to a significant change in the generation process. These results enhance our understanding of the internal dynamics of LLMs when handling conflicting information.

Domaines

Informatique et langage [cs.CL]

Fichier principal

2025.findings-emnlp.1116.pdf (1.91 Mo)

Origine	Fichiers éditeurs autorisés sur une archive ouverte
Licence	CC BY 4.0 - Attribution

Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-05375712

Soumis le : vendredi 21 novembre 2025-08:06:40

Dernière modification le : samedi 22 novembre 2025-03:20:39

Dates et versions

hal-05375712 , version 1 (21-11-2025)

Licence

CC BY 4.0 - Attribution

Identifiants

HAL Id : hal-05375712 , version 1
DOI : 10.18653/v1/2025.findings-emnlp.1116

Citer

Zineddine Tighidet, Andrea Mogini, Hedi Ben Younes, Jiali Mei, Patrick Gallinari, et al.. Context Copying Modulation: The Role of Entropy Neurons in Managing Parametric and Contextual Knowledge Conflicts. Findings of the Association for Computational Linguistics: EMNLP 2025, Nov 2025, Suzhou, China. pp.20469-20481, ⟨10.18653/v1/2025.findings-emnlp.1116⟩. ⟨hal-05375712⟩