Which Neurons Matter in IR? Applying Integrated Gradients-based Methods to Understand Cross-Encoders

Mathias Vast; Basile van Cooten; Laure Soulier; Benjamin Piwowarski

doi:10.1145/3664190.3672528

Communication Dans Un Congrès Année : 2024

Which Neurons Matter in IR? Applying Integrated Gradients-based Methods to Understand Cross-Encoders

(1, 2) , (3) , (1) , (1, 4)

1
2
3
4

Mathias Vast

Fonction : Auteur
PersonId : 1366751

Machine Learning and Information Access

Institut des Systèmes Intelligents et de Robotique

Basile van Cooten

Fonction : Auteur
PersonId : 1406573
ORCID : 0009-0002-0234-917X

Sinequa

Laure Soulier

Fonction : Auteur
PersonId : 8070
IdHAL : soulierl
ORCID : 0000-0001-9827-7400
IdRef : 189293683

Machine Learning and Information Access

Benjamin Piwowarski

Fonction : Auteur
PersonId : 9362
IdHAL : benjamin-piwowarski
ORCID : 0000-0001-6792-3262
IdRef : 226846601

Machine Learning and Information Access

Centre National de la Recherche Scientifique

Résumé

With the recent addition of Retrieval-Augmented Generation (RAG), the scope and importance of Information Retrieval (IR) has expanded. As a result, the importance of a deeper understanding of IR models also increases. However, interpretability in IR remains under-explored, especially when it comes to the models' inner mechanisms. In this paper, we explore the possibility of adapting Integrated Gradient-based methods in an IR context to identify the role of individual neurons within the model. In particular, we provide new insights into the role of what we call "relevance" neurons, as well as how they deal with unseen data. Finally, we carry out an in-depth pruning study to validate our findings.

Mots clés

Information Retrieval Cross-Encoders Explainability Integrated Gradients

Domaines

Recherche d'information [cs.IR]

Mathias Vast : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-04668348

Soumis le : mardi 6 août 2024-14:31:15

Dernière modification le : jeudi 12 décembre 2024-03:47:36

Dates et versions

hal-04668348 , version 1 (06-08-2024)

Identifiants

HAL Id : hal-04668348 , version 1
ARXIV : 2406.19309
DOI : 10.1145/3664190.3672528

Citer

Mathias Vast, Basile van Cooten, Laure Soulier, Benjamin Piwowarski. Which Neurons Matter in IR? Applying Integrated Gradients-based Methods to Understand Cross-Encoders. ICTIR '24: The 2024 ACM SIGIR International Conference on the Theory of Information Retrieval, Jul 2024, Washington DC, United States. pp.133-143, ⟨10.1145/3664190.3672528⟩. ⟨hal-04668348⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS ISIR SORBONNE-UNIVERSITE SU-SCIENCES ANR ISIR_MLIA

33 Consultations

0 Téléchargements

Which Neurons Matter in IR? Applying Integrated Gradients-based Methods to Understand Cross-Encoders

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager